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ENRICHMENT METHOD FOR VARIANT PROTEINS WITH ALTERED BINDING PROPERTIES 
reLd of THE INVENTION 

5 This invention relates to the preparation and systematic selection of novel binding proteins having 

altered binding properties for a target moiecule. Specifically, this invention relates to methods for producing 
foreign polypeptides mimicking the binding activity of naturally occurring binding partners. In preferred 
embodiments, the invention is directed to the preparation of therapeutic or diagnostic compounds that mimic 
proteins or nonpeptidyi molecules such a hormones, drugs and other smail moiecuies, particularly bioiogicaiiy active 
10 molecules such as growth hormone. 

BACKGROUND OF THE INVENTION 
Binding partners are substances that specifically bind to one another, usually through noncovalent 
interactions. Examples of binding partners include ligand-receptor, antibody-antigen, drug-target, and enzyme- 
substrate interactions. Binding partners are extremely useful in both therapeutic and diagnostic fields. 

1 5 Binding partners have been produced in the past by a variety of methods including; harvesting them 

from nature «e,g., antibody-antigen, and ligand-receptor pairings) and by adventitious identification (e.g. 
iradiiionai drug development employing random screening of candidate molecules). In some instances these two 
approaches have been combined. For example, variants of proteins or polypeptides, such as polypeptide 
fragments, have been made that contain key functional residues that participate in binding. These polypeptide 

2 0 fragments, in turn, have been derivaiized by methods akin io iradiiionai drug development An example of such 
derivitization would include strategies such as cyclization to conformationally constrain a polypeptide fragment to 
produce a novel candidate binding partner. 

The problem with prior art methods is thai naturally occurring ligands may not have proper 
characteristics for all therapeutic applications. Additionally, polypeptide ligands may not even be available for 

25 some target substances. Furthermore, methods for making non-naturally occurring synthetic binding partners 
are often expensive and difficult, usually requiring complex synthetic meSiods to produce each candidate. The 
inability to characterize the structure of the resulting candidate so that rational drug design methods can be 
applied for further optimization of candidate molecules further hampers these methods. 

in an attempt to overcome these probiems, Geysen (Geysen, immun. Today. 6:364-369 jl 985]) ; and 

3 0 (Geysen et a/., Mol. Immun. . 23:709-71 5 [1 986]) has proposed the use of polypeptide synthesis to provide a 
framework for systematic iterative binding partner identification and preparation. According to Geysen eta!., 
ibid, short polypeptides, such as dipeptides, are first screened for the ability to bind io a target moiecuie. The 
most active dipeptides are then selected for an additional round of testing comprising linking, to the starting 
dipeptide, an additional residue (or by internally modifying the components of the original starting dipeptide) and 

35 then screening this set of candidates for the desired activity. This process is reiterated until the binding partner 
having the desired properties is Identified. 

The Geysen ef si. method suffers from the disadvantage that the chemistry upon which it is based, 
peptide synthesis, produces molecules with ill-defined or variable secondary and tertiary structure. As rounds of 
iterative selection progress, random interactions accelerate among the various substituent groups of the 

40 polypeptide so that a true random population of interactive molecules having reproducible higher order structure 
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becomes less and less attainable. For example, interactions between side chains of amine acids, which are 
sequentially widely separated but which are spatially neighbors, freely occur. Furthermore, sequences that do not 
facilitate conformationally stable secondary structures provide complex peptide-sidechain interactions which may 
prevent sidechain interactions of a given amino acid with the target molecule. Such complex interactions are 
5 facilitated by the flexibility of the polyamide backbone of the polypeptide candidates. Additionally, candidates 
may exist in numerous conformations making it difficult to identify the conformer that interacts or binds to the 
target with greatest affinity or specificity complicating rational drug design. 

A final problem with the iterative polypeptide method of Geysen is that, at present, there are no 
practical methods with which a great diversity of different peptides can be produced, screened and analyzed. By 

1 o using the twenty naturally occurring amino acids, the total number of all combinations of hexapepSdes that must 
be synthesized is 64,000,000. Even having prepared such a diversity of peptides, there are no methods available 
with which mixtures of such a diversity of peptides can be rapidly screened to select those peptides having a high 
affinity for the target molecule. At present, each 'adherent- peptide must be recovered in amounts large enough 
to carry out protein sequencing. 

1 5 To overcome many of the problems inherent in the Geysen approach, biological selection and screening 

was chosen as an alternative. Biological selections and screens are powerful tools to probe protein function and to 
isolate variant proteins with desirable properties (Shortle, Protein Engineering. Oxender and Fox, eds., A.R. Liss, 
Inc., NY, pp. 103-108 [1988]) and Bowie et a'., Science. 247:1306-1310 [1990)]. However, a given selection or 
screen is applicable to only one or a smaii number of reiaied proieins. 

20 Recently, Smith and coworkers (Smith, Science, 228: 1315-1317 [1985]) and Parmley and Smith, Gene, 

73:305-318 [1985] have demonstrated that small protein fragments (10-50 amino adds) can be "displayed - 
efficiently on the surface of filamentous phage by inserting short gene fragments into gene III of the fd phage 
(■fusion phage"). The gene III minor coat protein (present in about 5 copies at one end of the virion) is important 
for proper phage assembly and for infection by attachment to the pili of £ coli (see Rasched etal. , Microbiol, 

25 Rev.. 50: 401-427 [1986]). Recently, fusion phage' have been shown io be useful for displaying short mutated 
peptide sequences for identifying peptides that may react with antibodies (Scott ef a/.. Science. 249: 386-390, 
[1390] land Cwirla etal FJSS Nat! Acad. U.S.A. 87: 6378-6382, [1990]).or a foreign protein (Devlin etsL 
Science . 249: 404-406 H9901). 

There are, however, several important limitations in using such "fusion phage" to identify altered 

30 peptides or proteins with new or enhanced binding properties. Hrsl.it has beenshown (Parmley e/a/.,fien£,73: 
305-318, [1988]) that fusion phage are useful only for displaying proteins of less than iOO and preferably iess 
than 50 amino acid residues, because large inserts presumably disrupt the function of gene III and therefore phage 
assembly and Meclfvfy. Second, prior art methods have been unable to select peptides from a library having the 
highest binding affinity for a target molecule. For example, after exhaustive panning of a random peptide library 

35 with an. antt-R endorphin monoclonal antibody, Cwirla and co-workers could not separate moderate affinity 

peptides (Kd ~ 10 (iM) from higher affinity peptides (Kd -0.4 u,M) fused to phage. Moreover, the parent R- 
endorphin peptide sequence which has very high affinity (Kd - 7nM), was not panned from the epitope library. 

LadnerWO 90/02802 discloses a method for selecting novel binding proteins displayed on the outer 
surface of cells and virai particles where it is contemplated thai the heterologous proteins may have up to 164 
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amino acid residues . The method contemplates isolating and amplifying the displayed proteins to engineer a new 
family of binding proteins having desired affinity for a target molecule. Mere specifically, Ladrser discloses a 
■fusion phage" displaying proteins having "initial protein binding domains" ranging from 46 residues (crambin) to 
164 residues (T4 lysozyme) fused to the M13 gene III coat protein. Ladner teaches the use of proteins "no larger 
5 than necessary" because it is easier to arrange restriction sites in smaller amino acid sequences and prefers the 58 
amino acid residue bovine pancreatic trypsin inhibitor (BPTI). Small fusion proteins, such as BPTI, are preferred 
when the target is a protein or macromolecule, while larger fusion proteins, such as T4 lysozyme, are preferred for 
smaii target molecules such as steroids because such large proteins have ciefis and grooves into which small 
molecules can fit. The preferred protein, BPTI, is proposed to be fused to gene III at the site disclosed by Smith 

1 0 et al. or de la Cruz et ai., J. Biol. Chem.. 263: 431 8-4322 [1 988], or to one of the terminii, along with a second 
synthetic copy of gene Hi so thai 'some" unaltered gene ill protein will be present. Ladner does not address the 
problem of successfully panning high affinity peptides from the random peptide library which plagues the 
biological selection and screening methods of the prior art. 

Human growth hormone (hGH) participates in much of the regulation of normai human growth and 

1 5 development. This 22,000 dalton pituitary hormone exhibits a multitude of biological effects including linear 
growth (somaiogenesis), lactation, activation of macrophages, insulin-like and diabetogenic effects among others 
(Chawla, R, K. (I983) Ann. Rev. Med. 34. 519; Edwards, C. K. et al. (1988) Science 239. 769; Thomer, M. 0., et ai. 
(1988) J. Clin. Invest. 81, 745). Growth hormone deficiency in children leads to dwarfism which has been 
successfully treated for more than a decade by exogenous administration of hGH. h.GH is a member of a family of 

2 0 homologous hormones thai include piaceniai iactogens, proiaciins, and other genetic and species varianis or growih 
hormone (Nicoll, C. S., etal., (1986) Endocrine Reviews 7. 169). hGH is unusual among these in that it exhibits broad 
species specificity and binds to either the cloned somatogenic (Leung, D. W., eta!., [1987] Nature 330. 537) or 
prolactin receptor (Boutin, j. M.,ef a/., [t98Sj Sfillii 69). The cloned gene for hGH has been expressed in a 
secreted form in ftcMichja g& (Chang, C. N., et al., [1987] £ej£ 55, 189) and its DNA and amino acid sequence has 

2 5 been reported (Goeddel, et ai., [1979] Nature 281. 544; Gray, ei a!., [1985] Gene 29., 247). The three-dimensional 

structure of hGH is not available. However, the three-dimensional folding pattern for porcine growth hormone 
(pGH) has been reported at moderate resolution and refinement (Abdel-Meguid, S. S., ef al., [1987] Proc. Natl. 
Acad. Sci. USA 84. 8434). Human growih hormone's receptor and antibody epitopes have beer, identified by 
homolog-scanning mutagenesis (Cunningham etal., Science 2J3j 1330, 1989). The structure of novel amino terminal 

3 0 methionyl bovine growth hormone containing a spliced-in sequence of human growth hormone including histidine 1 8 

and histidine 21 has been shown (U.S. Patent 4,880,910) 

Human growth hormone (hGH) causes a variety of physiological and metabolic effects in various animal 
models including linear bone growth, lactation, activation of macrophages, insulin-like and diabetogenic effects and 
others (R. K. Chawla ef ai., Ainu. Rev. Med. 34, 519 (1983); 0. G. P. Isaksson ef ai., Amu. Rev. Fhysioi. 47, 483 
35 (1985); C. K. Edwards etal., Science 239, 769 (1988); M. 0. Thorner and M. L. Vance, J. Clin. Invest. 82, 745 
(1988); J. P. Hughes and H. G. Friesen, Ann. Rev. Physiol. 47, 469 (1985)). These biological effects derive from 
the interaction between hGH and specific ceiiuiar receptors.. 

Accordingly, it is an object of this invention to provide a rapid and effective method for the systematic 
preparation of candidate binding substances. 
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it is another object of this invention io prepare candidate binding substances displayed on surface of a 
phagemid particle that are conformationally stable. 

!t is another object of this invention to prepare candidate binding substances comprising fusion proteins 
of a phage coat protein and a heterologous polypeptide where the polypeptide is greater than 100 amino acids in 
5 length and may be more than one subunit and is displayed on a phagemid panicle where the polypeptide is encoded 
by the phagemid genome. 

It is a further object of this invention to provide a method for the preparation and selection of binding 
substances that is sufficiently versatile to present, or display, all peptidyl moieties that could potentially 
participate in a noncovaleni binding interaction, and to present these moieties in a fashion that is sferically 
1 o confined. 

Still another object of the invention is the production of growth hormone variants that exhibit stronger 
affinity for growth hormone receptor and binding protein. 

It is yet another object of ibis invention to produce expression vector phagemids that contain a 
suppressible termination codon functionally located between the heterologous polypeptide and the phage coat 

1 5 protein such that detectable fusion protein is produced in a host suppressor cell and only the heterologous 
polypeptide is produced in a non-suppressor host ceil. 

Finally, it is an object of this invention to produce a phagemid particle that rarely displays more than one 
copy of candidate binding proteins on the outer surface of the phagemid particle so that efficient selection of high 
affinity binding proteins can be achieved. 

20 These and other objects of this invention will be apparent from consideration of the invention as a whole. 

SUMMARY OF THE INVENTION 
These objectives have been achieved by providing a meihod for selecting novel binding polypeptides 
comprising: (a) constructing a repticable expression vector comprising a first gene encoding a polypeptide, a 
second gene encoding at least a portion of a natural or wild-type phage coat protein wherein the first and second 

25 genes are heterologous, and a transcription reguiaiory element operabiy linked to the first and second genes, 

thereby forming a gene fusion encoding a fusion protein; (b) mutating the vector at one or more selected positions 
within the first gene thereby forming a family of related piasmids; (c) transforming suitable host cells with the 
plasmids; (d) infecting the transformed host ceils with a helper phage having a gene encoding the phage coat 
protein: (e) culturing the transformed infected host cells under conditions suitable for forming recombinant 

3 0 phagemid particles containing at least a portion of the plasmid and capable of transforming the host, the 

conditions adjusted so that no more than a minor amount of phagemid particles display more than one copy of the 
fusion, protein on the surface of the particle; (f) contacting the phagemid particles with a target molecule so that 
at least a portion of the phagemid particles bind to the target molecule; and (g) separating the phagemid 
particles that bind from those that do not Preferably, the method further comprises transforming suitable host 

35 cells with recombinant phagemid particles that bind to the target molecule and repeating steps (d) through (g) 
one or more times. 

Additionally, the method for selecting novel binding proteins where the proteins are composed of more 
than one subunit is achieved by selecting novel binding peptides comprising constructing a replicable expression 
vector comprising a transcription reguiaiory element operabiy linked to DNA encoding a protein of interest 
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containing one or more submits, wherein the DNA encoding at least one of the subunits is fused to the DNA 
encoding at least a portion of a phage coat protein:mutating the DNA encoding the protein of interest at one or 
more selected positions thereby forming a family of related vectors; transforming suitable host cells with the 
vectors; infecting the transformed host cells with a helper phage having a gene encoding the phage coat protein; 
culiurlng the transformed infected host cells under conditions suitable for forming recombinant phagemid 
particles containing at least a portion of the piasmid and capable cf transforming the host, the conditions adjusted 
so that no more than a minor amount of phagemid particles display more than one copy of the fusion protein on the 
surface of the particle; contacting the phagemid particles with a target molecule so that at least a portion of the 
phagemid particles bind to the target moiecuie; and separating the phagemid particles that bind from those that 
do not. 

Preferably in the method of this invention the piasmid is under tight control of the transcription 
regulatory element, and the culturing conditions are adjusted so that the amount or number of phagemid particles 
displaying more than one copy of the fusion protein on the surface of the particle is less than about 1 %. Also 
preferably, amount of phagemid particles displaying more than one copy of the fusion protein is less than 10% the 
amount of phagemid particles displaying a single copy of the fusion protein. Most preferably the amount is less 
than 20%. 

Typically, in the method of this invention, the expression vector will further contain a secretory signal 
sequences fused to the DNA encoding each subunit of the polypeptide, and the transcription regulatory element 
will be a promoter system. Preferred promoter systems are selected from; Lac Z, Xpi, TAC, T 7 polymerase, 
tryptophan, and alkaline phosphatase promoters and combinations thereof. 

Also typically, the first gene will encode a mammalian protein, preferably the protein will be selected 
from; human growth hormone(hGH), N-metrionyl human growth hormone, bovine growth hormone, parathyroid 
hormone, thyroxine, insulin A-chain, insulin B-chain, proinsulin, relaxin A-chain, relaxin B-chain, prorelaxin, 
glycoprotein hormones such as follicle stimulating hormoneiFSH), thyroid stimulating hormone(TSH), and 
leutinizing hormone(LH), glycoprotein hormone receptors, calcitonin, glucagon, factor VIII, an antibody, lung 
surfactant, urokinase, streptokinase, human tissue-type plasminogen activator (t-PA), bombesin, factor IX, 
thrombin, hemopoietic growth factor, tumor necrosis factor-alpha and -beta, enkephaiinase, human serum albumin, 
mullerian-inhibiting substance, mouse gonadotropin-associated peptide, a microbial protein, such as 
betaiactamase, tissue factor protein, inhibin, activin, vascular endothelial growth factor, receptors for hormones 
or growth factors; integrin, thrombopoietin, protein A or D, rheumatoid factors, nerve growth factors such as 
NQF-R platelet-growth factor, transforming growth factors (TGF) such as TGF-alpha and TGF-beta, insulin- 
like growth factor-i and -ii, insuiin-iike growth factor binding proteins , CD-4, DNase, latency associated peptide, 
erythropoietin, osteoinductive factors, interferons such as interferon-alpha, -beta, and -gamma, colony 
stimulating factors (GSFs) such as M-CSF, GM-CSF, and G-CSF, interleukins (ILs) such as IL-1, IL-2, IL-3, IL- 
4, superoxide dismuiase; decay accelerating factor, viral antigen, HIV envelope proteins such as GPI20, GP140, 
atrial natriuretic peptides A, B or C, immunoglobulins, and fragments of any of the above-listed proteins. 

Preferably the first gene will encode a polypeptide of one or more subunits containing more than about 
100 amino acid residues and will be folded to form a plurality of rigid secondary structures displaying a plurality 
of amino acids capable of interacting with the target. Preferably the first gene will be mutated at codons 



WO 92/09690 



6 



PCI7US9I/09133 



corresponding to only the amino acids capable of interacting with the target so that the integrity of the rigid 
secondary structures will be preserved. 

Normally, the method of this invention will employ a helper phage selected from; M13KO7.M13R408, 
M13-VCS, and Phi X 174. The preferred helper phage is M13K07, and the preferred coat protein is the M13 
5 Phage gene III coat protein. The preferred host is E coli, and protease deficient strains of E coli. Novel hGH 
variants selected by the method of the present invention nave been detected. Phagemid expression vectors were 
constructed that contain a suppressive termination codon functionally located between the nucleic acids encoding 
the polypeptide and the phage coat protein. 

BRIEF DESCRIPTION OF THE FIGURES 

1 0 FIGURE 1. Strategy for displaying large proteins on the surface of filamentous phage and enriching for 

altered receptor binding properties. A piasmid, phGH-M13glll was constructed that fuses the entire coding 
sequence of hGH to the carboxyl terminal domain of M13 gene III. Transcription of the fusion protein is under 
control of the lac promoter/operator sequence, and secretion is directed by the stll signal sequence. Phagemid 
particles are produced by infection with the "helper phage, M13K07, and particles displaying hGH can be 

1 5 enriched by binding to an affinity matrix containing the hGH receptor. The wild-type gene III (derived from the 
M13K07 phage) is diagramed by 4-5 copies of the multiple arrows on the tip of the phage, and the fusion protein 
(derived from the phagemid, phGH-M13glll) is indicated schematically by the folding diagram of hGH replacing 
the arrow head. 

FIGURE 2. Irnm jricbfoi of whole phage particles shows that hG! I ccrnigraies with phage. Pnagernid 

2 0 particles purified in a cesium chloride gradient were loaded into duplicate wells and electrophoresed through a 1 % 

agarose gel in 375 mM Tris, 40 mM glycine pH 9.6 buffer. The gel was soaked in transfer buffer (25 mM Tris, pH 
S.3. 200 mm glycine, 20% methanol) containing 2% SD3 and 2% p-rnercaptoethano! for 2 hours, then rinsed in 
transfer buffer for 6 hours. The proteins in the gel were then electroblotted onto immobilon membranes 
(Miliipore). The membrane containing one set of samples was stained with Coomassie blue to show the position of 
25 the phage proteins (A). The duplicate membrane was immune-stained for hGH by reacting the membrane with 
polyclonal rabbit antj-hGH antibodies followed by reaction with horseradish peroxidase conjugated goat anti- 
rabbit igG antibodies (B). Lane i contains the M13K07 parent phage and is visible only in the Coomassie blue 
stained membrane, since it lacks hGH. Lanes 2 and 3 contain separaie preparations of the hormone phagemid 
particles which is visible both by Coomassie and hGH immuno-staining. The difference in migration distance 

3 0 between the parent M13K07 phage and hormone phagemid particles reflects the different size genomes that are 

packaged within (S.7 kb vs. 5.1 kb, respectively). 

FIGURE 3. Summary diagram of steps in the selection process for an hGH-phage library randomized at 
codons 172, 174, 176, and 178. The template molecules, pH0415, containing a unique Kpn! restriction site and the 
hGH(R178G,ii79T) gene was muiagenized as described in the text and eiecirotransformed into E coli strain 
35 WJM101 to obtain the initial phagemid library, Library 1. An aliquot (approximately 2%) from Library 1 was used 
directly in an initial selection round as described in the text to yield Library 1G. Meanwhile, double-stranded DNA 
(dsDNA) was prepared from Library I, digested with restriction enzyme Kpnl to eliminate template background, 
and eiecirotransformed into WJM101 to yield Library 2. Subsequent rounds of selection (or Kpnl digestion, 
shaded boxes) followed by phagemid propagation were carried cut as indicated by the arrows, according to the 
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procedure described in the text. Four independent clones from Library 4G 4 and four independent clones from 
Library 5G 6 were sequenced by dideoxy sequencing. A!! of these clones had the identical D.NA sequence, 
corresponding to the hGH mutant (Glu 174 Ser, Phe 176 Tyr). 

FIGURE 4. Structural model of hGH derived from a 2.8 A folding diagram of porcine growth hormone 
5 determined crystaliographically. Location of residues in hGH that strongly modulate its binding to the hGH- 
binding protein are within the shaded circle. Alanine substitutions that cause a greater than tenfoid 
reduction^), a four- to tenfold reduction (•), or increase (O), or a two- to fourfold reduction (•), in binding 
affinity are indicated. Helical wheel projections in the regions of a-helix reveal their amphipathic quality. 
Blackened, shaded, or nonshaded residues are charged, polar, or nonpolar, respectively. In helix-4 the most 

1 0 important residues for mutation are on the hydrophilic face. 

RGURE 5. Amino acid substitutions at positions 172, 174, 176 and 178 of hGH (The notation, e.g. 
KSYR, denotes hGH mutant 1 72K/1 74S/1 76Y/1 78R.) found after sequencing a number of clones from rounds 1 
and 3 of the selection process for the pathways indicated (hGH elution; Glycine elution: or Glycine elutton after 
pre-adsorption). Non-functJonal sequences (i.e. vector background, or other prematurely terminated and/or 

1 5 frame-shifted mutants) are shown as "NP. Functional sequences which contained a non-silent, spurious mutation 
(i.e, out-side the set of target residues) are marked with a V. Protein sequences which appeared more than once 
among aii the sequenced dones, but with different DN A sequences, are marked with a "#". Protein sequences 
which appeared more than once among the sequenced clones and with the same DNA sequence are marked with a 
"" Note that after three rounds of selection, 2 different contaminating sequences were found: these clones did 

20 not correspond to cassette mutants, but to previously constructed hormone phage. The pS0643 contaminant 
corresponds to wild-type hGH-phage (hGH "KEFR"). The pH0457 contaminant, which dominates the third- 
round glycine-selected poo! of phage, corresponds to a wEviously identified mutant of hGH, "KSYR." The 
ampiificaiion oi ihese contaminants emphasizes uie ability of the homiorie phage selection process io seieci lor 
rarely occurring mutants. The convergence of sequences is also striking in all three pathways: R or K occurs most 

2 5 often at positions 1 72 and 1 78; Y or F occurs most often at position 1 76; and S, T, A and other residues occur at 
position 174. 

FIGURE 6. Sequences from phage selected on hPRLbp-beads in the presence of zinc. The notation is 
as described in Figure. 5. Here, the convergence of sequences is not predictable, but there appears to be a bias 
towards hydrophobic sequences under the mosi stringent (Glycine) seieciion conditions; L ,W and P residues are 
30 frequently found in this pool. 

FIGURE 7. Sequences from phage selected on hPRLbp-beads in the absence of zinc. The notation is as 
described in Figure 5. in contrast to the sequences of Figure. 6, these sequences appear more hydrophilic. After 
4 rounds of selection using hGH elution, two clones (ANHQ, and TLDT/1 71 V) dominate the pool. 

FIGURE S. Sequences from phage selected on blank beads. The notation is as described in Rg. 5. After 
35 three rounds of selection with glycine elution, no siblings were observed and a background level of non-functional 
sequences remained. 

FIGURE S. Construction of phagernid fi ori from pH0415. This vector for cassette mutagenesis and 
expression of the hGH-gene III fusion protein was constructed as follows. Piasmid pS0643 was constructed by 
oligonucleotide-directed mutagenesis of pS0132, which contains pBR322 and f1 origins of replication and 
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expresses an hGH-gene iii fusion protein (hGH residues MM. Stowed by a single Giy residue, fused to Pro-198 
of gene III) under the control of the f mlinhoA promoter. Mutagenesis was carried out with the oiigonucieoiide 
5 , -QQG-AQG-TGT-Q6G-TIC£rAG^GT-GGC-6GC-GGC-TCT-GGT-3 , I which introduced a Xbal Site 
(underlined) and an amber stop eodon (TAG) following Phe-1S1 of hGH. 
5 FIGURE 10. A. Diagram of plasmid pDH188 insert containing the DNA encoding the light chain and 

heavy chain (variable and constant domain 1 ) of the F a b humanized antibody directed to the HER-2 receptor. Vt_ 
and V H are the variable regions for the light and heavy chains, respectively. Gfc is the constant region of the human 
kappa light chain. CH1Q1 is the first constant region of the human gamma i chain. Both coding regions start with 
the bacteria! st !! signal sequence. B, A schematic diagram of the entire plasma pDH188 containing the insert 
10 described in 5A. After transformation of the plasmid into £. off SR101 cells and the addition of helper phage, 
the plasmid is packaged into phage particles. Some of these particles display the F a b-p I" fusion (where p iii is 
the protein encoded by the M13 gene 111 DNA). The segments in the plasmid figure correspond to the insert shown 
in5A. 

FIGURE 11 A through C are collectively referred to here as Figure 1 1. The nucleotide (Seq. ID No. 25) 
15 sequence of the DNA encoding the 4D5 F aD molecule expressed on the phagemid surface. The amino acid sequence 
of the light chain is also shown (Seq. ID No. 26), as is the amino acid sequence of the heavy chain p II! fusion (Seq. ID 
No. 27). 

FIGURE 12. Enrichment of wild-type 4D5 Fab phagemid from variant F aD phagemid. Mixtures of wild- 
type phagemid and variant 4D5 Fgu phagemid in a ratio of 1 :1,000 were selected on plates coated with the extra- 

20 cellular domain protein of the HER-2 receptor. After each round of selection, a portion of the eluted phagemid 
were infected into £ ca'/and plasmid DNA was prepared. This plasmid DNA was then digested with Eco RV and 
Pst I, separated on a 5% polyacryiamide gel, and stained with eihidiurn bromide. The bands were visualized under 
UV light. The bands due to the wild-type and variant pfasmids are marked with arrows. The first round of 
selection was eluted only under acid conditions; subsequent rounds were eluted with either an acid elut on (left 

25 side of Figure) or with a humanized 4D5 antibody wash step prior to acid elufion (right side of Figure) using 
methods described in Example VIII. Three variant 4D5 Fab molecules were made: H91 A (amino acid hisWine at 
position 81 on the V L chain mutated to alanine; indicated as 'A' lanes in Figure), Y49A (amino acid tyrosine at 
position 49 on the Vl chain mutated to alanine; indicated as B' lanes in the Figure), and Y92A (amino acid tyrosine 
at position 92 on the Vl chain mutated to alanine; indicated as 'C lanes in the Figure). Amino acid position 

30 numbering isaecording to ^baista!.,(Ssqa3ncesof proteins of immumhgicaU 4thed. i U.S. Deptof 
Health and Human Services, Public Health Service, Nad institute of Health, Beihesda, MD [1987]). 

FIGURE 13. The Scatchard analysis of the RIA affinity determination described in Experimental 
Protocols is shown here. The amount of labeled ECD anSgen that is bound is shown, on the x-axis while the amount 
that is bound divided by the amount that is free is shown on the y-axis. The siope of the line indicates the Ktf the 

35 calculated Kg is 1/Ka- 
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DETAILED DESCRIPTION OFTHE INVENTION 

The following discussion will be best understood by referring to Figure 1 . in its simplest form, the 
method of the instant invention comprises a meihod for selecting novel binding polypeptides, such as protein 
ligands. having a desired, usually high, affinity for a target molecule from a library of structurally related binding 
5 polypeptides. The library of structurally related polypeptides, fused to a phage coat protein, is produced by 
mutagenesis and, preferably, a single copy of each related polypeptide is displayed on the surface of a phagemid 
particle containing DNA encoding that polypeptide. These phagemid particles are then contacted with a target 
molecule and those particles having the highest affinity for the target are separated from those of lower affinity. 
The high affinity binders are then amplified by infection of a bacterial host and the competitive binding step is 
1 0 repeated. This process is reiterated until polypeptides of the desired affinity are obtained. 

The novel binding polypeptides or ligands produced by the method of this invention are useful per ss as 
diagnostics or therapeutics ( eg. agonists or antagonists) used in treatment of biological organisms. Structural 
analysis of the selected polypeptides may also be used to facilitate rational drug design. 

By 'binding polypeptide" as used herein is meant any polypeptide that binds with a selectable affinity to 
1 5 a target molecule. Preferably the polypeptide will be a protein that most preferably contains more than about 
100 amino acid residues. Typically the polypeptide will be a hormone or an antibody or a fragment thereof. 

By "high affinity" as used herein is meant an affinity constant {Kd ) of <10" 5 M and preferably <10" 7 M 
under physiological conditions. 

By "target molecule" as used herein is meant any molecule, not necessarily a protein, for which it is 
20 desirable to produce a iigand. Preferably, however, the target will be a protein and most preferably the target 
will be a receptor, such as a hormone receptor. 

By 'humanized antibody" as used herein is meant an antibody in which the complementarity-determining 
regions (CDRs) of a mouse or other non-human antibody are grafted onto a human antibody framework. By human 
antibody framework is meant the entire human antibody excluding the CDRs. 
25 L choice of Pofvoentides for Display on the Surface of a Phage 

The first step in the method of this invention is to choose a polypeptide having rigid secondary 
structure exposed to the surface of the polypeptide for display on the surface of a phage. 

By "polypeptide" as used herein is meant any molecule whose expression can be directed by a specific 
DNA sequence. The polypeptides of this invention may comprise more than one subunit, where each subunit is 
3 0 encoded by a separate DNA sequence. 

By "rigid secondary structure" as used herein is meant any polypeptide segment exhibiting a regular 
repeated structure such as is found in; a-helices, 3io helices, Tt-nelices, parallel and antiparallel (J-sheets, and 
reverse turns. Certain "non-ordered" structures that lack recognizable geometric order are also included in the 
definition of rigid secondary structure provided they form a domain or "patch" of amino acid residues capable of 
3 5 interaction with a target and that the overall shape of the structure is not destroyed by replacement of an amino 
acid within the structure . It is believed thai some non-ordered structures are combinations of reverse turns. The 
geometry of these rigid secondary structures is well defined by <f and y torsional angles about the a-carbons of 
the peptide "backbone". 
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The requirement that the secondary structure be exposed to the surface of the polypeptide is to 
provide a domain or "patch" of amino acid residues that can be exposed to and bind with a target molecule. It is 
primarily these amino acid residues that are replaced by mutagenesis that form the "library- of structurally 
related (mutant) binding polypeptides thai are displayed on the surface of the phage and from which novel 
5 polypeptide ligands are selected. Mutagenesis or replacement of amino acid residues directed toward the interior 
of the polypeptide is generally avoided so that the overall structure of the rigid secondary structure is preserved. 
Some replacement of amino acids on the interior region of the rigid secondary structures, especially with 
hydrophobic amino acid residues, may be tolerated since ihese conservative substitutions are unlikely to distort 
the overall structure of the polypeptide. 

1 o Repeated cycles of "polypeptide" selection are used to select for higher and higher affinity binding by 

the phagemid selection of multiple amino acid changes which are selected by multiple selection cycles. Following a 
first round of phagemid selection, involving a first region or selection of amino acids in the ligand polypeptide, 
additional rounds of phagemid selection in other regions or amino acids of the ligand polypeptide are conducted. 
The cycles of phagemid selection are repeated until the desired affinity properties of the ligand polypeptide are 

1 5 achieved. To illustrate this process, Example VII! phagemid selection of hQH was conducted in cycles. In the first 
cyde hGH amino acids 172, 174, 1 76 and 178 were mutated and phagemid selected, in a second cycle hGH amino 
acids 167, 171, 175 and 179 were phagemid selected. In a third cycle hGH amino acids 10, 14, 18 and 21 were 
phagemid selected. Optimum amino acid changes from a previous cycle may be incorporated into the polypeptide 
before the next cycle of selection. For example, hGH amino acids substitution 1 74 (serine) and 1 76 (tyrosine) 

20 were incorporated into the hGH before the phagemid selection of hGH amino acids 167, 171 , 175and179. 

From the forgoing it will be appreciated that the amino add residues that form the binding domain of 
the polypeptide will not be sequentially linked and may reside on different subuniis of the polypeptide. Thai is, 
the binding domain tracks with the particular secondary structure at the binding site and not the primary 
structure. Thus, generally, mutations will be introduced into cottons encoding amino acids within a particular 

25 secondary structure at sites directed away from the interior of the polypeptide so ihai ihey will have the 

potential to interact with the target. By way of iHustratJon. Figure 2 shows the location of residues in hGH that 
are known to strongly modulate its binding to the hGH-binding protein (Cunningham etal., Sjaerjci 247:1461- 
1465 [1990]). Thus representative sites suitable for mutagenesis wouid include residues 172, 174, 176, and 178 
on he!ix-4, as well as residue 64 located in a "non-ordered" secondary structure. 

30 There is no requirement thai the polypeptide chosen as a ligand to a target normally bind to that target. 

Thus, for example, a glycoprotein hormone such as TSH can be chosen as a ligand for ttie FSH receptor and a 
library of mutant TSH molecules are employed in the method of this invention to produce novel drug candidates. 

mis invention thus contemplates any polypeptide that binds to a target molecule, and includes 
antibodies. Preferred polypeptides are those that have pharmaceutical utility. More preferred polypeptides 

35 include; a growm hormone, including human growth hormone, des-N-methionyl human growth hormone, and bovine 
growth hormone; parathyroid hormone; thyroid stimulating hormone; thyroxine; insulin A-chain; insulin B-chain; 
proinsulin; follicle stimulating hormone; calcitonin; leutinizing hormone; glucagon; factor VIII; an antibody; lung 
surfactant; a plasminogen act'vator, such, as urokinase or human tissue-type plasminogen activator ft-PA): 
bombesin; factor IX, thrombin; hemopoietic growth factor; tumor necrosis factor-alpha and -beta;enkephalinase; a 
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serum albumin such as human serum albumin; mUteriamnhibiting substance; relaxin A-chain; relaxin B-chain; 
prorslaxin; mouse gonadotropin associated peptide; a microbial protein, such as betaiactamase; tissue factor 
protein; inhibin; activin; vascular endothelial growth factor; receptors for hormones or growth factors; integrin; 
thrombopoietin; protein A or D; rheumatoid factors; nerve growth factor such as NGF-B; platelet-derived growth 
5 factor; fibroblast growth factor such as aFGF and bFGF; epidermal growth factor; transforming growth factor 
(TGF) such as TGF-alpha and TGF-beta; insulin-iike growth factor-i and -ii; insulin-iike growth factor binding 
proteins; CD-4; DNase; latency associated peptide; erythropoietin; osteoinductive factors; an interferon such as 
inierferon-aipha, -beta, and -gamma; colony stimulating factors (CSFs), e.g., M-CSF, GM-CSF, and G-CSF; 
interleukins (ILs), e.g., IL-1, IL-2, IL-3, IL-4, etc.; superoxide dismutase; decay accelerating factor; atrial 

1 o natriuretic peptides A, B or C: viral antigen such as, for example, a portion of the HIV envelope; immunoglobulins; 

and fragments of any of the above-iisted polypeptides. In addition, one or more predetermined amino acid 
residues on the polypeptide may be substituted, inserted, or deleted, for example, to produce products with 
improved biological properties. Further, fragments of these polypeptides, especially biologically active 
fragments, are included. Yet more preferred polypeptides of this invention are human growth hormone , and 
1 5 atrial naturetic peptides A, B, and C, endotoxin, subiisin, trypsin and other serine proteases. 

SSI! more preferred are polypeptide hormones mat can be defined as any amino acid sequence produced 
in a first ceii thai binds specificaiiy to a receptor on the same cell type (autocrine hormones) or a second cell type 
(non-autocrine) and causes a physiological response characteristic of the receptor-bearing cell. Among such 
polypeptide hormones are cytokines, tymphokines, neurotrophic hormones and adenohypophyseal polypeptide 

2 0 hormones such as growth hormone, proiaciin, piaceriiai iacfegen, luteinizing hormone, foiiicie-sumuiaurig hormone, 

thyrotropin, chorionic gonadotropin, corticotropin, a or B-melanocyte-stimulating hormone, pNipotropin, y- 
lipotropin and the endorphins; hypothabnic release-inhibiting hormones such as cortieotropin-release factor, 
growth hormone release-inhibiting hormone, growth hontione-reiease factor; and oiher polypeptide hormones such 
as atrial natriuretic peptides A, B or C. 

The gene encoding the desired polypeptide (i.e., a poiypepiide wiih a rigid secondary struciure) can be 
obtained by methods known in the art (see generally, Sambrook ei al. , Molecular Biology: A Laboratory Manual . 
Cold Spring Harbor Press, Cold Spring Harbor, New York [1989]). If the sequence of the gene is known, the 
DNA encoding the gene may be chemically synthesized (Merrfiefci, J. Am. Chem. Soc... 85 2149 [1963]). if the 

3 0 sequence of the gene is not known, or if the gene has not previously been isolated, it may be cloned from a cDNA 

library (made from RNA obtained from a suitable tissue in which the desired gene is expressed) or from a suitable 
genomic DNA library. The gene is then isolated using an appropriate probe. For cDNA iibraries, suitable probes 
include monoclonal or polyclonal antibodies (provided that the cDNA library is an expression library), 
oligonucleotides, and complementary or homologous cDNAs or fragments thereof. The probes that may be used to 
3 5 isolate the gene of interest from genomic DNA libraries include cDNAs or fragments thereof that encode the same 
or a similar gene, homologous genomic DNAs or DNA fragments, and oligonucleotides. Screening the cDNA or 
genomic library with the selected probe is conducted using standard procedures as described in chapters 10-12 
of Sambrook era/., supra. 
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An alternative means to isolating the gene emoting the protein of interest is to use polymerase chain 
reaction methodology (PCR) as described in section 14 of Sambrook etal., supra. This method requires the use 
of oligonucleotides that will hybridize to the gene of interest; thus, at least some of the DNA sequence for this 
gene must be known in order to generate the oligonucleotides. 
5 After the gene has been isolated, it may be inserted into a suitable vector (preferably a piasmid) for 

amplification, as described generally in Sambrook etal., supra. 

While several types of vectors are available and may be used to practice this invention, piasmid vectors 
are the preferred vectors for use herein, as they may be constructed with relative ease, and can be readily 
1 o amplified. Piasmid vectors generaiiy contain a variety of components including promoters, signal sequences, 
phenotypic selection genes, origin of replication sites, and other necessary components as are known to those of 
ordinary ski!! in the art. 

Promoters most commoniy used in prokaryofic vectors include the Jac Z promoter system, the alkaline 
phosphatase pte A promoter, the bacteriophage XPL promoter (a temperature sensitive promoter), the lac 

1 5 promoter {a hybrid £f±-lac promoter that is regulated by the Jac repressor), the tryptophan promoter, and the 
bacteriophage T7 promoter. For general descriptions of promoters, see section 17 of Sambrook etal supra . 
While these are the most commonly used promoters, other suitable microbial promoters may be used as well. 

Preferred promoters for practicing this invention are those that can be tightly regulated such that 
expression of the fusion gene can be coniroiied. it is beiieved ihat the problem that went unrecognized :n the 

20 prior art was that display of multiple copies of the fusion protein on the surface of the phagemid particle lead to 
multipoint attachment of the phagemid with the target It is beloved this effect, referred to as the "chelate 
effect", results in selection of false 'high affinity' polypeptides when multiple copies of the fusion protein are 
displayed on the phagemid particle in close proximity to one another so that the target was 'chelated'. When 
multipoint attachment occurs, the effective or apparent Kd may be as high as the product of the individual Kds 

25 for each copy of the displayed fusion protein, mis effect may be ihe reason Cwiria and coworkers supra were 
unable to separate moderate affinity peptides from higher affinity peptides. 

it has been discovered that by tightly regulating expression of the fusion protein so that no more than a 
minor amount, i.e. fewer than about 1%, of the phagemid particles contain multiple copies of me fusion protein the 
■chelate effect" is overcome allowing proper selection of high affinity polypeptides. Thus, depending on the 

30 promoter, culiuring conditions of the host are adjusted to maximize the number of phagemid particles containing a 
single copy of the fusion protein and minimize the number of phagemid particles containing multiple copies of the 
fusion protein. 

Preferred promoters used to practice this invention are the lac Z promoter and the nho. A promoter. 
The lac Z promoter is regulated by the lac repressor protein lac i, and thus transcription of the fusion gene can be 
35 controlled by manipulation of the level of the lac repressor protein. By way of illustration, the phagemid containing 
the jac Z promoter is grown in a cell strain ihat contains a copy of the jac i repressor gene, a repressor for the 
lac Z promoter. Exemplary cell strains containing the jaci gene include JM 101 and XL1 -blue. In the alternative, 
the host cell can be cotransfected with a piasmid containing both the repressor Jac i and the lac Z promoter. 
Occasionally both of the above techniques are used simultaneously, that is, phagmide particles containing the lac Z 
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promoter are grown in cell strains containing the Jac i gene and the cell strains are cotransfected with a plasmid 
containing both the Jac Z and lac i genes. Normally when one wishes to express a gene, to the transfected host 
above one would add an inducer such as iscpropyithiogaiactoside (IPTG). in the present invention however, this 
step is omitted to (a) minimize the expression of the gene III fusion protein thereby minimizing the copy number 
5 (i.e. the number of gene !!! fusions per phagemid number) and to <b) prevent poor or improper packaging of the 
phagemid caused by inducers such as IPTG even at low concentrations. Typically, when no inducer is added, the 
number of fusion proteins per phagemid particle is about 0.1 (number of bulk fusion proteins/number of phagemid 
particles). The most preferred promoter used to practice this invention is si& A. This promoter is believed to be 
regulated by the level of inorganic phosphate in the cell where the phosphate acts to down-regulate the activity of 

1 0 the promoter. Thus, by depleting cells of phosphate, the activity of the promoter can be increased. The desired 
result is achieved by growing cells in a phosphate enriched medium such as 2YT or LB thereby controlling the 
expression of the gene III fusion. 

One other useful component of vectors used to practice this invention is a signal sequence. This sequence 
is typically located immediately 5' to the gene encoding the fusion protein, and will thus be transcribed at the amino 

1 5 terminus of the fusion protein. However, in certain cases, the signal sequence has been demonstrated to be located 
at positions other 5" to the gene encoding the protein to be secreted. This sequence targets the protein to which 
it is attached across the inner membrane of Die bacterial ceil. The DNA encoding ttne signal sequence may be 
obtained as a restriction endonuclease fragment from any gene encoding a protein that has a signal sequence. 
Suitable prokaryotic signal sequences may be obtained from genes encoding, for example, LamB or OropF (Wong 

20 eiai.. Gene . 53:193 [1983]), MalE, FhoA and other genes. A preferred prokaryotic signal sequence for practicing 
this invention is the E coli heat-stable enterotoxin II (STIl) signal sequence as described by Chang etal. , fisns, 
55: 189 [1987]. 

Anoiher usefui component of the vectors used to practice this invention is phenoiypic selection genes. 
Typical phenotypic selection genes are those encoding proteins that confer antibiotic resistance upon the host cell. 
25 By way of illustration, the ampicillin resistance gene (amp) , and the tetracycline resistance gene (ret) are readily 
employed for ihis purpose. 

Construction of suitable vectors comprising the aforementioned components as well as the gene encoding 
the desired polypeptide (gene 1) are prepared using standard recombinant DNA procedures as described in 
Sambrook era/, supra, isolated DNA fragments to be combined to form ihe vector are cieaved, iaiiored, and 
30 ligated together in a specific order and orientation to generate the desired vector. 

The DNA is cleaved using the appropriate restriction enzyme or enzymes in a suitable buffer. In general, 
about 0.2-1 p.g oi plasmid or DNA fragments is used with about 1-2 units of the appropriate restriction enzyme 
in about 20 u.1 of buffer solution. Appropriate buffers, DNA concentrations, and incubation times and 
temperatures are specified by the manufacturers of the restriction enzymes. Generally, incubation times of about 
35 one or two hours at 37'C are adequate, although several enzymes require higher temperatures. After incubation, 
the enzymes and other contaminants are removed by extraction of the digestion solution with a mixture of phenol 
and chloroform, and the DNA is recovered from the aqueous fraction by precipitation with ethane!. 

To ligate the DNA fragments together to form a functional vector, the ends of the DNA fragments 
must be compatible with each other. In some cases, ihe ends will be directly compatible after endonuclease 
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digestion. However, it maybe necessary to first convert Sie sticky ends commonly produced by endonuclease 
digestion to blunt ends to make them compatible for ligation. To blunt the ends, the DNA is treated in a suitable 
buffer for at least 15 minutes at 15*0 with 10 units of of the Wenow fragment of DNA polymerase I (Klenow) in 
the presence of the four deoxynucieoiide triphosphates. The DNA is then purified by phenol-chloroform 
5 extraction and ethanol precipitation. 

The cleaved DNA fragments may be size-separated and selected using DNA gel electrophoresis. The 
DNA may be eiectrophoresed through either an agarose or a polyacr/iamide matrix. The selection of the matrix 
will depend on the size of the DNA fragments to be separated. After electrophoresis, the DNA is extracted from 
the matrix by electrocution, or, if low-melting agarose has been used as the matrix, by melting the agarose and 

1 0 extracting the DNA from it, as described in sections 6.30-8.33 of Sambrook et supra. 

The DNA fragments that are to be ligated together (previously digested with the appropriate 
restriction enzymes such, that the ends of each fragment to be ligated are compatible) are put in solution in about 
equimoiar amounts. The solution will also contain ATP, ligase buffer and a ligase such as T4 DNA ligase at about 
10 units per 0.5 u.g of DNA. If the DNA fragment is to be ligated into a vector, the vector is at first linearized by 

1 5 cutting with the appropriate restriction endonuc!ease(s). The linearized vector is then treated with alkaline 
phosphatase or calf intestinal phosphatase. The phosphaiasing prevents seiHigatien of the vector during the 
ligation step. 

After ligation, the vector with the foreign gene now inserted is transformed into a suitable host cell. 
Prokaryotes are the preferred host ceiis for this invention. Suitable prokaryotic host cells include £ coli strain. 

20 JM101, £ coli K12 strain 294 (ATCC number 31,446), £ coB strain W3110 (ATCC number 27,325), £ coll 
X1776(ATCC number 31, 537). E.co!i XL-1B!ue(stratagene) ; and E.coli B: however many other strains of £ 
coli, such as HB101 , NMS2Z, NM538, NM539, and many other species and genera of prokaryotes may be used as 
well. In addition to the £ coli strains listed above, bacilli such as Bacillus SUbtiliS, other enterobacteriaceae such as 
ffalmnnrilatvnhimii^ <* senafia mamesans. and various Pseudomonas species may all be used as hosts. 

25 Transformation of prokaryotic ceils is readily accomplished using the calcium chloride method as 

described in section 1.82 of Sambrook ef a/., supra. Alternatively, electror^^ 
1:641 [1982]) may be used to transform these cells. The transformed cells are selected by growth on an 
antibiotic, commonly tetracycline (tet) or ampiciiiin (amp), to which they are rendered resistant due to the 
presence of tet and/or amp resistance genes on the vector. 

3 o After selection of the transformed cells, these cells are grown, in culture and the piasmid DNA (or other 

vector with the foreign gene inserted) is then isolated. Piasmid DNA can be isolated using methods known in the 
art Two suitable methods are the small scale preparation of DNA and the large-scale preparation of DNA as 
described in sections 1.25-1 .33 of Sambrook ef si., supra. The isolated DNA can be purified by methods known in 
the art such as that described in section 1 .40 of Sambrook et a/., supra. This purified piasmid DNA is ihen 

■35 analyzed by restriction mapping and/or DNA sequencing. DNA sequencing is generally performed by either the 
method of Messing era/. Njjcjgjfi Adds Res.. 9309 M9S11 or by the method of Maxam et a!. Msth, BgyffWl,, 65: 
499 [1980]. 
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IV. ftcnaRrinn 

This invention contemplates fusing the gene enclosing the desired polypeptide (gene 1 ) to a second gene 
(gene 2) such that a fusion protein is generated during transcription. Gene 2 is typically a coat protein gene of a 
phage, and preferably it is the phage M13 gene III coat protein, or a fragment thereof. Fusion of genes 1 and 2 may 
5 be accomplished by inserting gene 2 into a particular site on a piasmid that contains gene 1 , or by inserting gene 1 
into a particular site on a piasmid that contains gene 2. 

Insertion of a gene into a piasmid requires that the piasmid be cut at the precise location that the gene is 
to be inserted. Thus, there must be a restriction endonuclease site at this location (preferably a unique site such 
that the piasmid will only be cut at a single location during restriction endonuclease digestion). The piasmid is 

1 0 digested, phosphatased, and purified as described above. The gene is then inserted into this linearized piasmid by 
iigating the two DMAs together. Ligation can be accomplished if the ends of the piasmid are compatible with the 
ends of the gene to be inserted. If the restriction enzymes are used to cut the piasmid and isolate the gene to be 
inserted create blunt ends or compatible sticky ends, the DNAs can be ligated together directly using a ligase 
such as bacteriophage T4 DNA iigase and incubating the mixture at 1oC for 1-4 hours in the presence of ATP 

1 5 and ligase buffer as described in section 1 .68 of Sambrook ef al., supra . If the ends are not compatible, they must 
first be made blunt by using the Klenow fragment of DNA polymerase I or bacteriophage T4 DNA polymerase, 
both of which require the four deoxyribonudeoiide iriphosphaies to fiii-in overhanging single-stranded ends of 
the digested DNA. Alternatively, the ends may be blunted using a nuclease such as nuclease S1 or mung-bean 
nuclease, both of which function by cutting back the overhanging single strands of DNA. The DNA is then 

20 reiigated using a iigase as described above, in some cases, it may not be possible to biunt ine ends of the gene to 
be inserted, as the reading frame of the coding region will be altered. To overcome this problem, oligonucleotide 
linkers may be used The linkers serve as a bridge to connect the piasmid to the gene to be inserted. These linkers 
can be made syrriheiicaiiy as double stranded or single siranded DNA using standard methods. The linkers have 
one end that is compatible with the ends of the gene to be inserted; the linkers are first ligated to this gene using 

25 ligation methods described above. The other end of the linkers is designed to be compatible with the piasmid for 
iigation. in designing the iinkers, care must be taken to noi destroy the reading frame of the gene to be inserted 
or the reading frame of the gene contained on the piasmid. In some cases, it may be necessary to design the linkers 
such that they code for part of an amino acid, or such that they code for one or more amino acids. 

Between gene 1 and gene 2, DNA encoding a termination codon may be inserted, such termination codons 

30 are UAG( amber), UAA (ocher) and UGA (opel). (Microbiology, Davis et al. Harper & Row, New York, 1980, 

pages 237, 245-47 and 274). The termination codon expressed in a wild type host cell results in the synthesis of 
the gene 1 protein product without the gene 2 protein attached. However, growth in a suppressor host cell 
results in the synthesis of detectable quantities of fused protein. Such suppressor host cells contain a tRNA 
modified to insert an amino acid in the termination codon position of the mRNA thereby resulting in production of 

3 5 detecfJble amounts of the fusion protein. Such suppressor host cells are well known and described, such as E.coii 
suppressor strain (Bullock et al., BioTechnioues 5, 376-379 [1987]). Any acceptable method may be used to 
place such a termination codon into ihe mRNA encoding the fusion polypeptide. 

The suppressible codon may be inserted between the first gene encoding a polypeptide, and a second 
gene encoding at least a portion of a phage coat protein. Alternatively, the suppressible termination codon may be 
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inserted adjacent to the fusion site by replacing the iast amino acid iriptet in the polypeptide or the first amino 
acid in the phage coat protein. When the phagemid containing the suppressive codon is grown in a suppressor 
host ce!!, it results in the detectable production of a fusion polypeptide containing the polypeptide and the coat 
protein. When the phagemid is grown in a non-suppressor host ceil, the polypeptide is synthesized substantially 
5 without fusion to the phage coat protein due to termination at the inserted suppressive triplet encoding UAG, 
UAA, or UGA. In the non-suppressor cell the polypeptide is synthesized and secreted from the host celi due to the 
absence of the fused phage coat protein which otherwise anchored it to the host ceil. 
V. Alteratianrmutfltlp n) rrf Gene 1 at Selected Positions 

Gene 1 , encoding She desired polypeptide, may be altered at one or more selected codons. An alteration 

1 o is defined as a substitution, deletion, or insertion of one or more codons in the gene encoding the polypeptide that 
results in a change in the amino acid sequence of the polypeptide as compared with the unaltered ornative 
sequence of the same polypeptide. Preferably, the alterations will be by substitution of at least one amino acid 
with any other amino acid in one or more regions of the molecule. The alterations may be produced be a variety of 
methods known in the art. These methods include but are not limited to oligonucleotide-mediated mutagenesis and 

1 5 cassette mutagenesis. 

Oligonucleotide -mediated mutagenesis is preferred method for preparing substitution, deletion, and 
insertion variants of gene 1 . This technique is we!! known in the art as described by Zoiier et aL Nucleic Acids Res. 
JQ: 6487-6504 [1987]. Briefly, gene 1 is altered by hybridizing an oligonucleotide encoding the desired mutation 
20 to a DNA template, where the template is the single-stranded form of the plasmid containing the unaltered or 
native DNA sequence of gene 1 . After hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template wiii thus incorporate the oligonucleotide primer, and will code for the 
selected alteration in gene 1. 

Generally, oligonucleotides of at least 25 nucleotides in length are used. An optima! oligonucleotide will 
25 have 12 to 15 nucleotides that are completely complementary to the template on either side of the nucieoiide(s) 
coding for the mutation. This ensures that the oligonucleotide will hybridize properly to the single-stranded DNA 
template moiecuie. The oligonucleotides are readily synthesized using techniques known in the art such as that 
described by Crea er al. Prnr Natl. Acad. Sci. USA. 75: 5765 [1978). 

The DNA template can only be generated by those vectors that are either derived from bacteriophage 
30 MT3 vectors (the commercially available Ml3mp18 and M13mp19 vectors are suitable}, or those vectors that 
contain a single-stranded phage origin of replication as described by Viera etal. Meth. Enzymol,, 153:3 [1987]. 
Thus, the DNA that is to be mutated must be inserted into one of these vectors in order to generate single- 
stranded template. Production ofihe single-stranded template is described in sections 4.21-4.41 of Sambrook 
etal., supra. 

35 To alter the native DNA sequence, the oligonucleotide is hybridized to the single stranded template 

under suitable hybridization conditions. A DNA polymerizing enzyme, usually the Klenow fragment of DNA 
polymerase I, is then added to synthesize the complementary strand of the template using the oligonucleotide as a 
pimer for synthesis. A heteroduplex molecule is thus formed such that one strand of DNA encodes the mutated 
form of gene 1 , and the other strand (the original template) encodes the native, unaltered sequence of gene 1 . 
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This heteroduplex molecule is then transformed into a suitable host cell, usually a prokaryote such as E Coli 
JM101 . After growing the cells, they are plated onto agarose plates and screened using the oligonucleotide primer 
radlolabelled with 32-Phosphate to identify the bacterial colonies that contain the mutated DNA. 

The method described immediately above may be modified such that a homodupiex molecule is created 
5 wherein both strands of the plasmid contain the mutaSon(s). The modifications are as fellows: The single- 
stranded oligonucleotide is anneaied to the single-stranded template as described above. A mixture of three 
deoxyribonudeotides, deoxyriboadenosine (dATP), deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), 
is combined with a modified thio-deoxyribocytosine called dCTP-{aS) (which can be obtained from Amersham). 
This mixture is added to the template-oligonucleotide complex. Upon addition of DNA polymerase to this mixture, 

10 a strand of DNA identical to the template except for the mutated bases is generated. In addition, this new strand 
of DNA will contain dCTP-(aS) instead of dCTP, which serves to protect it from restriction endonuclease 
digestion. After the template strand of the double-stranded heteroduplex is nicked with an appropriate 
restriction enzyme, the template strand can be digested with Exolll nuclease or another appropriate nuclease past 
the region that contains the site(s) to be mutagenized. The reaction is then stopped to leave a molecule thai is 

1 5 only partially single-stranded. A complete double-stranded DNA homodupiex is then formed using DNA 

polymerase in the presence of all four deoxyribonucleotide triphosphates, ATP, and DNA ligase. This homodupiex 
molecule can then be transformed into a suitable host cell such as E coli JM101, as described above. 

Mutants with more than one amino acid to be substituted may be generated in one of several ways. If the 
amino acids are located close together in the polypeptide chain, they may be mutated simultaneously using one 

20 oligonucleotide thai codes for ail of the desired amino acid substitutions. If, however, the amino acids are located 
some distance from each other (separated by more than about ten amino acids), it is more difficult to generate a 
single oligonucleotide that encodes all of the desired changes. Instead, one of two alternative methods may be 
employed. 

in the first method, a separate oligonucleotide is generated for each amino acid to be substituted. The 

2 5 oligonucleotides are then annealed to the single-stranded template DNA simultaneously, and the second strand of 

DNA that is synthesized from the template will encode all of the desired amino acid substitutions. The alternative 
method involves two or more rounds of mutagenesis to produce the desired mutant. The first round is as 
described for the single mutants: wild-type DNA is used for the template, an oligonucleotide encoding the first 
desired amino acid substitutions) is anneaied to this iempiate, and the heteroduplex DNA moiecuie is then 

3 0 generated. The second round of mutagenesis utilizes the mutated DNA produced in the first round of 

mutagenesis as the template. Thus, this template already contains one or more mutations. The oligonucleotide 
encoding the additional desired amino acid substitution(s) is then annealed to this template, and the resulting 
strand of DNA now encodes mutations from both the first and second rounds of mutagenesis. This resultant DNA 
can be used as a template in a third round of mutagenesis, and so on. 

This method is also a preferred method for preparing substitution, deletion, and insertion variants of 
gene 1. The method is based on that described by Weils elai. Gene. 34:315 [1885].. The starting material is the 
plasmid (or other vector) comprising gene 1 , the gene to be mutated. The codon(s) in gene 1 to be mutated are 
identified. There must be a unique restriction endonuclease site on each side of the identified mutation site(s). If 
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no such resiriciion sites exist, they may be generated using the above-described oligonudeotide-mediated 
mutagenesis method to introduce them at appropriate legations in gene 1. After the resiriciion sites nave been 
introduced into the plasmid, the plasmid is cut at these sites to linearize it. A doubie-stranded oligonucleotide 
encoding the sequence of the DNA between the restriction sites but containing the desired mutat" on(s) is 
5 synthesized using standard procedures. The two strands are synthesized separately arw then hybridized 
together using standard techniques. This double-stranded oligonucleotide is referred to as the cassette. This 
cassette is designed to have 3' and 5' ends that sss compatible with the ends of the linearized plasmid, such that it 
can be directly ligated to the plasmid. This plasmid now contains the mutated DNA sequence of gene i . 

10 in an alternative embodiment, this invention contemplates production of variants of a desired protein 

containing one or more submits. Each suburit is typk^ly encoded by separate gene. Each gene encoding each 
subunit can be obtained by methods known in the art (see, for example. Section II). In some instances, it may be 
necessary ioobtain the gene encoding the various subunits using separate techniques selected from any of the 
methods described in Section II. 

1 5 When constructing a replicable expression vector where the protein of interest contains more than one 

subunit, ail subunits can be regulated by the same promoter, typically located 5' to the DNA encoding the subunits, 
or each may be regulated by separate promoter suitably oriented in the vector so that each promoter is operabiy 
!inked te {he DNA it is intended to regulate . Selection of promoters is carried out as described in Section III 
above. 

20 In constructing a replicable expression vector containing DNA encoding the protein of interest having 

multiple subunits, the reader is referred to Figure 10 where, by way of illustration, a vector is diagrammed 
showing DNA encoding each subunit of an antibody fragment This figure shows that, generally, one of the 
subunits of the protein of interest will be fused to a phage coat protein such as M13 gene III. This gene fusion 
generally wii! contain its own signal sequence. A separate gene encodes the other subunit or subunits, and it is 

25 apparent that each subunit generally has its own signal sequence. Figure 10 also shows that a single promoter can 
regulate the expression of both subunits. Alternatively, each subunit may be independently regulated by a 
different promoter. The protein of interest subunit-phage coat protein fusion construct can be made as 
described in Section IV above. 

When constructing a family of variants of the desired multj-subunit protein, DNA encoding each subunit 

30 in the vector may mutated in one or more positions in each subunit. When multi-subunit antibody variants are 
constructed, preferred sites of mutagenesis correspond to codons encoding amino acid residues located in the 
comrtementarity-determining regions (CDR) of either the light chain, the heavy chain, or both chains. The CDRs 
are commonly referred to as the hypervariabie regions. Methods for mutagenizing DNA encoding each subunit of 
the protein of interest are conducted essentially as described in Section V above. 

35 

VII. Preparing a Targe t Ma'ssife aid Binding with Phaaeffikj 

Target proteins, such as receptors, may be isolated from natural sources or prepared by recombinant 
methods by procedures known in the art. Byway of illustration, glycoprotein hormone receptors may be prepared 
by the technique described by McFarfand el a/., Seisms 245:494433 [13831, nonglycosylated forms expressed 
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in £ co// are described by Fun et at. J. Biol. Chem 265:31 1 1-31 15 [1990] Other receptors can be prepared by 
standard methods. 

The purified target protein may be attached to a suitable matrix such as agarose beads, acryiamide 
beads, glass beads, cellulose, various acrylic copolymers, hydroxylalkyl methacrylate gels, polyacryfic and 
5 polymethacrylic copolymers, nylon, neutral and ionic carriers, and the like. Attachment of the target protein to the 
matrix may be accomplished by methods described In MetlPflSin Enzymoiogy, 44 [1976], or by other means known 
in the art. 

After attachment of the target protein to the matrix, the immobilized target is contacted with the 
library of phagemid particles under conditions suitable for binding of at least a portion of the phagemid particles 
1 0 with the immobilized target. Normally, the conditions, including pH, ionic strength, temperature and the like will 
mimic physiological conditions. 

Bound phagemid particles ('binders') having high affinity for the immobilized target are separated 
from those having a low affinity (and thus do not bind to the target) by washing. Binders may be dissociated 
from the immobilized target by a variety of methods. These methods include competitive dissociation using the 
1 5 wild-type ligand, altering pH and/or ionic strength, and methods known in the art. 

Suitable host cells are infected with the binders and helper phage, and the host cells are cultured under 
conditions suitable for amplification of the pnagernid particles. The phagemid particles are then collected and the 
selection process is repeated one or more times until binders having the desired affinity for the target molecule 

20 Optionally the library of pnagernid particles may be sequentially contacted with more than one 

immobilized target to improve selectivity for a particular target. For example, it is often the case that a ligand 
such as hGH has more than one natural receptor. In the case of hGH. both the growth hormone receptor and the 
prolactin receptor bind the hGH ikjand. It may be desirable to improve the selectivity of hGH for the growth 
hormone receptor over the prolactin receptor. This can be achieved by first contacting the library of phagemid 

25 particles with immobilized prolactin receptor, eluting those with a low affinity (i.e. lower than wild type hGH) for 
the prolactin receptor and then contacting the low affinity prolactin 'binders* or non-binders wiih ihe 
immobilized growth hormone receptor, and selecting for high affinity growth hormone receptor binders. In this 
case an hGH mutant having a tower affinity for the prolactin receptor would have therapeutic utility even if the 
affinity for the growth hormone receptor were somewhat tower than that of wiid type hGH. This same sirategy 

30 may be employed to improve selectivity of a particular hormone or protein for its primary function receptor over 
its clearance receptor. 

in another embodiment of this invention, an improved substrate amino acid sequence can be obtained. 
These may be useful for making better "cut sites" for protein linkers, or for better protease 
substrates/inhibitors. In this embodiment, an irnrnobilizabie molecule (e.g. hQH-receptor, biotin-avidin, or one 
35 capable of covaient linkage with a matrix) is fused to gene III through a linker. The linker will preferably be from 3 
to 10 amino acids in length and will act as a substrate for a protease. A phagemid will be constructed as described 
above where the DNA encoding the linker region is randomly mutated to produce a randomized library of phagemid 
particles with different amino acid sequences at the linking site. The library cf phagemid particles are then 
immobilized on a matrix and exposed to a desired protease. Phagemid particles having preferred or better 



WO 92/09690 



20 



PCT/US91/09133 



subsiraie amino aeid sequences in the liner region for the desired protease will be eluted, first producing an 
enriched pool of phagemid particles encoding preferred linkers. These phagemid particles are then cycled several 
more limes to produce an enriched pool of particles encoding consense sequence® (see examples XIII and XIV). 
vlii. .-,^h to-me Va-tents and Methods Cf 'JSC 
5 The cloned gene for hGH has been expressed in a secreted form in tsasricna £2ia (Chang, C. N>, ei aJ., 

[1987] Gene 55. 189) and its DNA and amino acid sequence has been reported (Goeddel, etal. [l979]Natum2&L 
544; Gray ei ai., [13851 fiauaSa, 247). The present invention describes novel hGH. variants produced using the 
phagemid selection methods. Human growth hormone variants containing substitutions at positions 10, 14, 18, 21, 
167,171,172, 174,175,176,178 and 179 have been described. Those raving higher binding affinities are 

1 0 described in Tables Vii, Xiii and XIV. The amino acid nomenclature for describing the variants is shown below. 
Growth hormone variants may be administered and formulated in the same manner as regular growth hormone. The 
growth hormone variants of the present invention may be expressed in any recombinant system which is capable of 
expressing native or met hGH. 

Therapeutic formulations of hGH for therapeutic administration are prepared for storage by mixing 

1 5 hGH having the desired degree of purity with optional physiologically acceptable carriers, excipients, or 

stabilizers fRprninpfnrVs Pharmaceutical Sciences. 16th edition, Oscl, A., Ed., (1980)., in the form of lyophilized 
cake or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the dosages 
and concentrations employed, and include buffers such as phosphate, citrate, and other organic acids; antioxidants 
including ascorbic acid; «w molecular weight (less than about 1 0 residues) polypeptides; proteins, such as serum 

20 albumin, gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glufamine, asparagine, arginine, or lysine; monosaccharides, disaccharides, and other carbohydrates including 
glucose, mannose, or dextrins; chelating agents such as EDTA; divalent metal ions such as zinc, cobalt or copper; 
sugar alcohols such as mamitol or sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants 
such 3S T W8Sn> piunjnics or polyethylene glycol (PEG). Formulations of the present invention may additionally 

25 contain a pharmaceufjcaiiy acceptable buffer, amino acid, bulking agent and/or non-ionic surfactant These include, 
for example, buffers, chelating agents, antioxidants, preservatives, cosolvents, and the like; specific examples of 
these could include, trimethylamaine salts (Tris buffer), and disodium edetate. The phagemids of the present 
invention may be used to produce quantities of the hGH variants free of the phage protein. To express hGH 
variants free of the gene III portion of the fusion, pS0643 and derivatives can simply be grown in a non- 

30 suppressor strain such as 16C9. In this case, the amber codon (TAG) leads to termination of translation, which 
yields free hormone, without the need for an independent DNA construction. The hGH variant is secreted from the 
host and may be isolated from the culture medium. 

One or more of the eighthGHamino acids F1Q, M14, H18, H21, P.167, D171.T175 and 1179 may bereped 
by any amino acid other than the one found in that position in naturally occurring hGH as indicated. Therefore, i , 2, 

35 3, 4, 5, 6,7, or all 8 of the indicated amino acids, F10, M14, H18, H21, R167, D171, T175 and 1179, maybe replaced 
by any of the other 19 amino acids out of the 20 amino acids listed below. In a preferred embodiment, all eight 
listed amino acids are replaced by another amino acid. The most preferred eight amino acids to be substituted are 
indicated in Table XIV in Example XII. 
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Ala (A) 
Arg (R) 

Asn (N) 
Asp (D) 
Cys(C) 
Gin (Q) 
Glu(E) 
G!y «G) 
His (H) 
lie (I) 
Leu (L) 
Lys (K) 
Met (M) 
Phe (F) 
Pro (P) 
Ser (S) 
Thr (T) 
Trp <W) 
Tyr (Y) 
Va! (V) 

The one letter hGH variant nomenclature first gives the hGH amino acid deleted, for example glutamate 1 79; then 
the amino acid inserted; for example, serine; resulting in (E1795S). 

EXAMPLES 

Without further description, it is believed that one of ordinary skill in the art can, using the preceding 
description and illustrative examples, make and utilize the present invention to the fullest extent. The following 
working examples therefore specifically point out preferred embodiments of the present invention, and are not to 
be construed as limiting in any way of the remainder of the disclosure. 

EXAMPLE i 

Plasmid Constructions and Preparation of hGH-phagemid Particles 

The plasmid phGH-M13glll (Fig. 1), was constructed from M13K07 7 and the hGH producing plasmid, 
pB0473 (Cunningham, 8. C, era/. , Science . 243:1330-1335. }i989j). A synthetic oligonucleotide 5-AGC- 
TGT-GGC-TTC- GGG-CCC -TTA-GCA-TTT-AAT-GCG-GTA-3' was used to introduce a unique Apa\ 
restriction site (underlined) into pB0473 after the final Pheigi codon of hGH. The oligonucleotide 5'-TTC- 
ACA-AAC-GAA- GGG-CCC-C TA-ATT-AAA-GCC-AGA-3' was used to introduce a unique Apa\ restriction site 
(underlined), and a Glul97-to-amber stop codon (bold lettering) into M13K07 gene III. The oligonucleotide 5'- 
CAA-TAA-TAA-CGG -GCT-AGC -CAA-AAG-AAC-TGG-3 1 introduces a unique Atfjel site (underlined) after the 
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3- end of the gene iii coding sequence. The resulting 650 base pair (bp) Apa\-Nhe\ fragment from the doubly 
mutated M13K07 gene III was cloned into the large Ajoal-Aftel fragment of pB0473 to create the piasmid, 
pS0132. This fuses the carboxyl terminus of UGH (Phe191) to the Pro198 residue of the gene III protein with the 
insertion of a glycine residue encoded from the Apa\ site and places the fusion protein under control of the £ coH 
5 a itelirerix)sphatase(prwA)rxomoterarristllsec^ 

, 1S87]) For inducible expression of the fusion protein in rich media, we replaced the phoA promoterwith the lac 
promoter and operator. A 138 bp tcoRi-Abai fragment containing the lac promoter, operator, and Gap binding 
site was produced by PCR of piasmid pUCl 19 using the oligonucleotides 5'- 

CACGACAfMoHQCGGAGTGGAAA-3' and S'-CTGTT TCTAGAGTGAAATTGTTA-3' that flank the desired 

1 0 lac sequences and introduce the EcoRi and Xbai restriction sites (underlined). This lac fragment was gel purified 
and ligated into the large EcoRI-Xbal fragment of pS0132 to create the piasmid, phGH-M13glll. The sequences 
of all tailored DNA junctions were verified by the dideoxy sequence method (Sanger, F, etal. Proff, Natl, Acad, 
Rra U.S.A. 74:5463-5467. [1977]). Tne R64A variani hGK phagemid was constructed as follows: the Nsi!-Bg!N 
mutated fragment of hGH (Cunninghamef al. supra ) encoding the Arg64 to Ala substitution (R64A) 

1 5 (Cunningham, B. C, Wells, J. A., £cj2Q££, 244:1081-1085, [1989]) was cloned between the corresponding 
restriction sites in the phGH-Ml3giii piasmid (Fig. 1) to replace the wild-type hGH sequence. The R64A hGH 
phagemid particles were propagated and titered as described below for the wild-type hGH-phagemid. 

Piasmids were transformed into a male strain of £ coS (JM101) and selected on carbenicillin plates. A 
single transformant was grown in 2 mi 2YT medium for 4 h at 37"C and infected with 50 pi of M13K07 helper 

20 phage. The infected culture was diluted into 30 ml 2YT, grown overnight, and phagemid particles were harvested 
by precipitation with polyethylene glycol (Vlsrra, J., Messing, J. Methods in Enzvmoloov, 153:3-1 1 , [1987]). 
Typical phagemid particle titers ranged from 2 to 5 x 10 ■ 1 cfu/mi. The partieies were purified to homogeneity by 
CsCI density centrifugation (Day, LA -i MaL Biol.. 39265-277, [1969]) to remove any fusion protein not 
attached to virions. 

25 EXAMPLE ii 

irnwxxiierricatArarysesrthGHOT 
Rabbit poiycionai antibodies to hGH were purified with protein A, and coated onto microliter plates 
(Nunc) at a concentration of 2 ng/ml in 50 mM sodium carbonate buffer (pH 10) at 4'C for 16-20 hours. After 
washing in PBS containing 0.05% Tween 20, hGH or hGH-phagemid particles were serially diluted from 2.0 - 

3 0 0.002 nM in buffer A (50 mM Tris (pH 7.5), 50 mM NaCL 2 mM EDTA, 5 mg/m! bovine serum albumin, and 0.05% 
Tween 20). After 2 hours at room temperature (rt), the plates were washed well and the indicated Mab 
(Gunninghamer al supra ) was added at 1 ug/ml in buffer A for 2 hours at rt. Following washing, horseradish 
peroxidase conjugaied goat anti-mouse IgG antibody was bound at rt for 1 hour. After a final wash, the 
peroxidase activity was assayed with the substrate, oHJhenyienediamine. 

3 5 EXAMPLE IV 

Coupling of the hGH Birring Protein to Pdpcrylarnkfe Beads and Binding Enrichments 
Oxirane polyacrylamide beads (Sigma) were conjugated to the purified extracellular domain of the hGH 
receptor (hGHbp) (Fuh, G., eta!., J. Blot. Chem.. 265:3111-3115 [19901) containing an extra cysteine residue 
introduced by site-directed mutagenesis at position 237 ttiat does not affect binding of hGH (J. Wells, 
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unoobiished). ThehGHbp was conjugated mmmmM»U&**™™P*"*™*> 
HockedwimBSAandTri^ 

couoled to the beads. Buffer for adsorption and washing contained 10 mM Tns HCI (pH 7.5), 1 mM EDTA, 50 mM 
_ NaCi1m ^ 

glycine (pH 2.1). Parental phage Mi 3K07 was mixed with hQH phagemid particles at a ratio of nearly 3000:1 
(cnaina, mixture) and tumtted for 8-1 2 h with a 5 ui aliquot (0.2 mg of a^arr,kJe beads) of either absor-nt in a 
- vo . ume at room tempera ture. The beads were pelleted by centrifugal and the supernate care.u.,y 

removed ^^mmM*^*™*^^*^*™"^"**™ 
,n ^1) Anerasecondwash(wash2Mhebeads^ 

ieiuatei e!uate2). The final elution was with a glycine buffer (pH 2.1) for 4 hours to remove remaining hGH 
phagemid particles (eluate 3). Each fraction was diluted appropriately in 2YT meda, mixed with fresh JM101 . 
incubated at 37'C for 5 minutes, and plated with 3 ml of 2YT soft agar on LB or LB carbenicillin plates. 
EXAMPLE IV 

15 construction ot hGH-phagertd Particles with s Mixture of Gene !« Products 

The oene III orotein is composed of 410 residues divided into two domains thai are separated by a 
ftexiae linker s^uence (Armstrong, J., et a,.. EESiiilL 135:167-172, [1981])- The amino-terminal domain ,s 
required for attachment to the pill of £ coif, while the carboxyf-termina. domain is imbedded in the phage coat and 
^^.nrn.oerohaoeassemWy (Crissman, J. W., Smith, G. P., WW, 132:445-455.(1364]). Tr* rigna. 
20 sequerxsandami^^ 
s^a) by tuston forest 

under control of the lac promoter/operator in a plasmid (phGH-M13gill; Fig. 1) containing it* pBRSB 
B-iactamase gene and Col E1 replfcation origin, and the oha«fl inlergenfc region. The vector can be ea^ly 

malntoirnriasasmai.^^ 
25 for propa^n amatively, the tfasmid can be efficiently packaged into virions (caited phagemid particles) by 
infection with helper phage such as M13K07 (Vieraef a/., s^) which avoids problems of pnage assembly. 
Phagemid infectivity titers based upon transduction to carbenicillin resistance in this system varied from 2-5 x 
10 11cok)n y forming units (cfuyml. The titer of theM13K07 helper phage in these phagemki stocks is -10 - 
plaque forming units (pfu)/'m!. 

30 With this system we confirmed previous studies (Parmiey, Smith supra) that homogeneous expression of 

— ^ t-d to oene 111 is deteterious to phage production (data not shown). For exampie, induction of the 
.acpromoterinphGH-MiaglllbyadditionoflrnGprodu^ Moreover, phagemid partcles 

produced by co-infecttonwithM13K07 containing an amber mutation in ger^ III gave very low phagemid titers 
, <10 10 cWm!) . we believed that mulfipte copies of the gene III fusion attached to the phagemid surtace couio 

35 lead to multiple point attachment (the "chelate effect") of the fusion phage to the immobilized target protein. 
Therefore to control the Won protein copy number we limited transcription of the hGH-gene III fusion by 
eu|faring{ ^^ 

The E coli JM101 cultures containing P hGH-M13glll -were best propagated and infected with M13K07 in the 
absence of the lacoperon inducer (IPTG); however, this system is ftexible so that co-expression of other gene III 
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fusion proteins can be balanced. We estimate that about 10% of the phagemid partides contain one copy of the 
hGH gene !!! fusion protein from the ratio of the amount of hGH per virion (based on hGH immuno-reactive material 
in CsCI gradient purified phagemid). Therefore, the titer of fusion phage displaying the hGH gene III fusion is 
about 2 - 5 x 10 10 /ml. This number is much greater than the titer of £ col! (~1o8 to 10 9 /m!) in the culture from 
5 which they are derived. Thus, on average every £ coir ceii produces 10-100 copies of phage decorated with an 
hGH gene III fusion protein. 

EXAMPLE V 

Structural integrity of the hGH-gane ii Fusion 
ImmunoWot analysis (Fig. 2) of the hGH-gene III phagemid show that hGH cross-reactive material 
1 0 emigrates with phagemid particles in agarose gels. This indicates that the hGH is tightly associated with 

phagemid particles. The hGH-gene III fusion protein from the phagemid partides runs as a single immuno-stained 
band showing that there is little degradation of the hGH when it is attached to gene III. Wild-type gene III protein 
is clearly present because about 25% of the phagemid partides are infectious. This is comparable to specific 
infectiviry estimates made for wild-type M13 phage thai are similarly purified (by CsCI density gradients) and 
1 5 concentrations estimated by UV absorbance (Smith, G. P. supra and Parmley, Smith supra) Thus, both wild-type 
gene III and the hGH-gene III fusion proteins are displayed in the phage pod. 

ii was important to confirm thai ihe tertiary structure oi the displayed hGH was maintained in order to 
have confidence that results from binding selections will translate to the native protein. We used monodonal 
antibodies (Mate) to hGH to evaluate the structural integrity of the displayed hGH gene HI fusion protein (Table 
20 I). 

TABLE L Binding of Eight Dffferent Mooocicnai 
Antibodies (Mab's) to hGH and hGH Phagemid Parades* 



25 



Mab 


ICso (nM) 




hGH 


hGH-phagemid 


1 


0.4 


0.4 


2 


0.04 


0.04 


3 


02 


0.2 


4 


O.i 


0.1 


5 


0.2 


>2.0 


6 


0.07 


0.2 


7 


0.1 


0.1 


8 


0.1 


0.1 



vaiues given represent me concentration (nM) of hGH or hGH-phagemid particles to give half-maximal binding to 
the particular .Mab. Standard errors in these measurements are typically at or below ±30% of the reported value 
See Materials and Methods for further detaiis. hwwuv«uub. 

3 0 The epitopes on hGH for these Mate nave been mapped (Cunningham ef a/., supra) and binding for 7 of 

8 Mabs requires that hGH be properly folded. The IC50 values for all Mabs were equivalent to wild-type hGH 
except for Mab 5 and 6 . Both Mabs 5 and 6 a'e known to have binding determinants near the carboxyi-ierminus of 
hGH which is blocked in the gene ill fusion protein. The relative 1C50 value for Mabl which reacts with both native 
and denatured hGH is unchanged compared to the conformafJonally sensitive Mabs 2-5, 7 and 8. Thus, Mabl 

35 serves as a good internal control for any errors in matching the concentration of the hGH standard to thai of ihe 
hGH-gene iii fusion. 
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EXAMPLE V! 
Binding Enrichments on Rec e pto r Affinity Beads 

Previous workers (Parmley, Smith supra ; Scott, Smith supra; Cwirla etal. supra; and Devlin era/. 
5 supra) have fractionated phage by panning with strsptavidin coated polystyrene petri dishes or microiiter plates. 
However, chromatographic systems would allow more efficient fractionation of phagemid particles displaying 
mutant proteins with different binding affinities. We chose non-porous oxirane beads (Sigma) to avoid trapping 
of phagemid particles in the chromatographic resin. Furthermore, these beads have a small particle size (1 jim) to 
maximize the surface area to mass ratio. The extracellular domain of the hGH receptor (hGHbp) (Fuh etal. , 
1 0 supra) containing a free cysteino residue was efficiently coupled to these beads and phagemid particles showed 
very low non-specific binding to beads coupled only to bovine serum albumin (Table I!}. 

TABLE II. 



15 Specific Binding cf Hormone Phage to hGHbp-coaied 

Beads Provides an Enrichment for hGH-phage over M13K07 Phage* 





Sample Absorbent* 


Total pfu 


Total cfu 


Ratio (cfu/pfu) 


Enrichment§ 


20 


Original mixturet 


8.3 x 10 11 


2.9 x 10 8 


3.5 x 10" 4 


(1) 




Supernatant BSA 


7.4 x 10 11 


2.8 x 10 8 


3.8 x 10" 4 


1.1 




hGHbp 


7.6 x10 11 


3.3 x10 8 


4.3 x 10- 4 


1.2 




Wash 1 BSA 


1.1 x 10 10 


6.0 x 10 6 


5.5 x 10" 4 


1.6 




hGHbp 


1.9 x 10 10 


1.7 x 10 7 


8.9 x 10' 4 


2.5 


25 


Wash 2 BSA 


5.9 x 10 7 


2.8 x 10 4 


4.7 x 10' 4 


1.3 




hGHbp 


4.9 x 10' 


2.7 x 10 8 


5.5 x 10- 2 


1.6 x 1fj2 




Eluate 1 (hGH)BSA 


1.1 x 10 s 


1.9 x 10 3 


1.7 x 10-3 


4.9 




hGHbp 


1.2 x 10 6 


2.1 x 10 6 


1.8 


5.1 x 10 3 




Eluate 2 (hGH)BSA 


5.9 x 10 5 


1.2 x 10 3 


2.0 x 10" 3 


5.7 


30 


hGHbp 


5.5 X10 5 


1.3 X 10 6 


2.4 


6.9 x 10 3 




Eluate 3 (pH 2.1) BSA 


4.6 x 10 5 


2.0 x 10 3 


4.3 x ID' 3 


12.3 




hGHbp 


3.8 x 10 5 


4.0 X10 6 


10.5 


3.0 xlO 4 



*The titers of M13K07 and hGH-phagemid particles in each fraction was determined 
35 by multiplying the number of plaque forming units (pfu) or carbenicillin resistant 
colony forming units (cfu) by the dilution, factor, respectively. See Example IV for 
details. 

TThe ratio of M13K07 to hGH-phagemid particles was adjusted to 3000:1 in the 

original mixture, 
40 i Absorbents were conjugated with BSA or hGHbp, 

§Enrichments are calculated by dividing the cfu/pfu ratio after each step by cfu/pfu 

ratio in the original mixture. 

in a typical enrichment experiment (Table II), one part of hGH phagemid was mixed with >3,C00 parts 

M13K07 phage. After one cycle of binding and elutJon, 10 6 phage were recovered and the ratio of phagemid to 
45 M13K07 phage was 2 to 1 . Thus, a single binding selection step gave >5000-fold enrichment. Additional elutions 

with free hGH or acid treatment to remove remaining phagernids produced even greater enrichments. The 

enrichments are comparable to those obtained by Smith and coworkers using batch elution from coated 

polystyrene plates (Smith, G.P. supra and Parmely, Smith supra ) however much smaller volumes are used on the 
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beads (200 til vs. 6 mi). There was almost no enrichment for the hGH phagemid over M13K07 when we used 
beads linked only to BSA. The slight enrichment observed for control beads (~iO-foid for pH 2.1 eiuiion; Table 
2) may result from trace contaminants of bovine growth hormone binding protein present in the BSA linked to the 
bead. Nevertheless these data show ihe enrichments for the hGH phage depend upon the presence of the hGHbp 
5 on me bead suggesting binding occurs by specific interaction between hGH and the hGHbp. 

We evaluated the enrichment for wild-type hGH over a weaker binding variant of the hGH on fusion 
phagemids to further demonstrate enrichment speaiieity, and to link the reduction in binding affinity for the 
purified hormones to enrichment factors after panning fusion phagemids. A fusion phagemid was constructed 
with an hGH mutant in which Arg64 was substituted with Aia (R64A). The R64A variant hormone is about 20- 
1 0 fold reduced in receptor binding affinity compared to hGH (Kd values of 7.1 nM and 0.34 nM, respectively 

[Cunningham, Wells, supra ]). The filers of the R64A hGH-gene III fusion phagemid were comparable to those of 
wild-type hGH phagemid. After one round of binding and eiuiion (Table III) the wild-type hGH phagemid was 
enriched from a mixture of the two phagemids pius M13K07 by 8-fold relative to the phagemid R64A, and -10+ 
relative to M13K07 helper phage. 

' 5 TABLE III. hGHbjKoated Beads Select for hGH Phagemids 

Over a Weaker Binding hGH Variant Phagemid 



f^ntml heads 



Sample wTphanemid enrichment VVT Bf)33SffliJ 

ioiai phagemid for WT/P.64A total phagemid forWT/R64A 



Original mixture 
Supernatant ND 

Elutionl(hGH) 7/20 0.8 17/20 



8/20 (1) 
4/10 1-0 



8.5* 

Elution2(pH2.1) 11/20 1.8 21/27 



'The parent M13K07 phage, wild-type hGH phagemid and R54A phagemid particles were mixed at a ratio of 
i0 4 0 AH 6 Binding selections were carried out using beads linked with BSA (control beads) or with the hGHbp 
{hGHbp beads) as described in Table I! and the Materials and Methods After each step, plasmid DNA was 
so!ated(Bimboim, H. C, Doly, J. , Nucleic Acids RgS„ 7:1513-1523, [1979]) ^ m «^ ni *^fa ^~>°nies 
and analyzed by restriction analysis to determine if it contained the wild-type hGH or the R64A hGH gene ill 
fusion. 

TThe enrichment for wild-type hGH phagemid over R64A mutant was calculated from the ratio of hGH phagemid 
preVenl Each step tothai present in ihe original mixture (8,-20), divided by ihe corresponding ratio for 
RB4A phagerr.ids. WT = wild-type; ND = not determined. 

*The enrichment for phagemid over total M13K07 parental phage was ~10 4 after this step. 



By displaying a mixture of wild-type gene III and the gene III fusion protein on phagemid particles one can 
assemble and propagate virions that display a large and proper folded protein as a fusion to gene III. The copy 
number of the gene III fusion protein can be effectively controlled to avoid 'chelate effects" yet maintained at high 
enough levels in the phagemid pool to permit panning of large epitope libraries (>10 10 ). We have shown thathGH 
(a 22 kD protein) can be displayed in its native folded form. Binding selections performed on receptor affinity 
beads eluted with free hGH, efficiently enriched for wild-type hGH phagemids over a mutant hGH phagemid shown 
to have reduced receptor binding affinity. Thus, it is possible to sort phagemid particles whose binding constants 
are down in the nanomolar range. 
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Protein-protein and antibody-antigen interactions are dominated by discontinuous epitopes (Janin, J., 

eiai. , Mol BifiL 204:155-164, [1388]; Argcs, P., Prat. Eng.. 2:101-113, [1983]; Bartow, D.J.,efa'. , NalUffi. 

322:747-748, [1987]; and Davies, D.R., etai. , J. Biol, chem.. 2 63:10541-10544. [1388]); that is the residues 

directly involved in binding are close in tertiary structure but separated by residues not involved in binding. The 

5 screening system presented here should allow one to analyze more conveniently protein-receptor interactions and 

isolate discontinuous epitopes in proteins with new and high affinity binding properties. 

EXAMPLE VII 
Selection of hGH Mutants from a Library Randomized at 
hGH Codons 172, 174, 176, 178 

10 Construction of template 

A mutant of the hGH-gene III fusion protein was constructed using the method of KunkeL.ef al. Meth. 
Enzyme!. 154, 367-382 [1987]. Template DNA was prepared by growing the plasmid pS0132 (containing the 
natural hGH gene fused to the carboxy-ierminai naif of M13 gene iii, under control or the aikaiine phosphatase 
promoter) in CJ236 cells with M13-K07 phage added as helper. Single-stranded, uracil-containing DNA was 
1 5 prepared for mutagenesis to introduce {1 ) a mutation in hGH which would greatly reduce binding to the hGH 
binding protein (hGHbp); and (2) a unique restriction site (Kpni) which couid be used for assaying for - and 
selecting against - parental background phage. Oligorwdeofide-directed mutagenesis was carried out using T7 
DNA polymerase and the following cligcdeoxy-nucleotide: 

Gly Thr 

20 hGH codon: 178 179 

5'-G ACA TTC CTG SGI A£C GTG CAG T-3' 
< Kpni > 

This oligo introduces the Kpni site as shown, along with mutations (R178G, I179T) in hGH. These mutations are 
predicted to reduce binding of hGH to hGHbp by mere than 30-fokj. Clones from the mutagenesis were screened 
25 by Kpni digestion and confirmed by dideoxy DNA sequencing. The resulting construct, to be used as a template 
for random mutagenesis, was designated pH04l5. 
Random rraisasnssis within hsi!x4 of hGH 

Codons 172, 174, 176, 178 were targeted for random mutagenesis in hGH, again using the method of 
Kunkel. Single-stranded template from pH0415 was prepared as above and mutagenesis was carried out using 

3 0 the following pool of oiiges: 

hGH codon: 172 174 

5'- GC TTC AGG AAG GAC ATG GAC HNS. GTC UNS. ACA- 

35 He 
176 178 179 

- NNS CTG WWS AXC GTG CAG TGC CGC TCT GTG G-3 ' 

As shown, this oligo pool reverts codon 179 to wild-type (lie), destroys the unique Kpni site of pH0415, and 
introduces random codons (NNS, where N= A,G,C, or T and S= G or C) at positions 1 72,1 74,1 76, and 1 78. Using 

4 0 this codon selection in the context of the above sequence, no additional Kpn! sites can be created. The choice of the 

NNS degenerate sequence yields 32 possible codons (including one "stop" codon, and at least one codon for each 
amino acid) at 4 sites, for a total of (32) 4 = 1,048,576 possible nucleotide sequences (12% of which contain at 
least one stop codon), or (20) 4 = 160,000 possible polypeptide sequences plus 34,481 prematurely terminated 
sequences (i.e. sequences containing at least one stop codon). 
45 Propagation of the Initial library 
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The mutagenesis products were extracted mice with pheraishtofoforrn (50:50) and ethane! 
precipitated with an excess of carrier tRNA to avoid adding salt that would confound the subsequent 
electroporation step. Approximately 50 ng (15 fmois) of DNA was electroporated into WJM101 cells (2.8 x 10 1 0 
cells/mL) in 45 pi total voiume in a 0.2 cm cuvette at a voltage setting of 2.43 kV with a single pulse (time 
5 constant = 4.7 msec.). 

The ceils were allowed to recover 1 hour at 37°C with shaking, then mixed with 25 mL 2YT medium, 1 00 
u.g/mL carhenidiiin, and M13-K07 (multiplicity of infection = 1000). Plating of serial dilutions from this culture 
onto carbenicillin-containing media indicated that 8.2 x 10 6 electrotransformants were obtained. After 10' at 
23°C, the culture was incubated overnight (15 hours) at 37°C with snaking. 

1 o After overnight incubation, the ceils were peiieied, and double-stranded DNA (dsDNA), designated 

pLIB1 , was prepared by the alkaline lysis method. The supernatant was spun again to remove any remaining cells, 
and the phage, designated phage poo! *1 , were PEG-predpitated and resuspended in 1 mL STE buffer (10 mM 
Tris, pH 7.6, 1 mM EDTA, 50 mM NaCI). Phage titers were measured as colony-forming units (CFU) for the 
recombinant phagemid containing hGH-g3p gene III fusion (hGH-g 3 ) plasmid, and plaque-forming units (PFU) for 

15 the M13-KC7 helper phage. 

1. BINDING: An aliquot of phage pool <J>1 (6x 10 9 CFU, 6 x10 7 PFU) was diluted 4.5-fold in buffer A 
(Phosphate-buffered saline, 0.5% BSA, 0.05% Tween-20, 0.01% thimerosal) and mixed with a 5 ul suspension of 
oxirane-polyacrylamide beads coupled to the hGHbp containing a Ser237 Cys muianon (350 frnois) in a 1.5 mL 

20 silated polypropylene tube. As a control, an equivalent aliquot of phage were mixed in a separate tube with beads 
that had been coated with BSA only. The phage were allowed to bind to the beads by incubating 3 hours at room 
temperature (23°C) with slow rotation (approximately 7 RPM). Subsequent steps were carried out with a 
constant volume of 200ul and at room temperature. 

2. WASH: The beads were spun 15 sec., and the supernatant was removed (Sup. 1). To remove 

25 phage/phagemid not specifically bound, the beads were washed twice by resuspending in buffer A, then pelleting. 
A final wash consisted of rotating the beads in buffer A for 2 hours. 

3. hGH ELUTION: Phage/phagemid binding weakly to the beads were removed by stepwise elufjon with 
hGH. In the first step, the beads were rotated with buffer A containing 2 nM hGH. After 17 hours , the beads 
were pelleted and resuspended in buffer A containing 20 nM hGH and rotated for 3 hours, then pelleted. In the 

3 0 final hGH wash, the beads were suspended in buffer A containing 200 nM hGH and rotated for 3 hours then 
pelleted. 

4. GLYCINE ELUTION: To remove the tightest-binding phagemid (i.e. those still bound after the hGH 
washes), beads were suspended in Glycine buffer (1 LLGIycine, pH 2.0 with HC1), rotated 2 hours and pelleted. 
The supernatant (fraction "G"; 200ul) was neutralized by adding 30 ui of 1 M Tris base. 

35 Fraction G eiuted from the hGHbp-beads (1 x 10 6 CFU, 5 x 10 4 PFU) was not substantially enriched 

for phagemid over K07 helper phage. We believe this resulted from the fact that K07 phage packaged during 
propagation of the recombinant phagemid display the hGH-g3p fusion. 
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However, when compared with fraction G eluted from the BSA-coated control beads, the hGHbp-beads 
yielded 14 times as many CFU's. This reflects the enrichment of tight-binding hGH-displaying phagemid over 
nonspectncaiiy-binciing phagemid. 

5. PROPAGATION: An aliquot (4.3 x 10 s CPU) of fraction G eluted from the hGHbp-beads was used 
5 to infect log-phase WJM101 cells. Transductions were carried out by mixing 100 ui fraction G with 1 mL WJM101 
ceils, incubating 20 min. at 37©C, then adding K07 (muiiipiiciiy of infeciion= 1000). Cuiiures (25 mL 2YT pius 
carbenicillin) were grown as described above and the second pool of phage (Library 1G, for first glycine elution) 
were prepared as described above. 

Phage from library 1 G (Fig. 3) were selected for binding to hGHbp beads as described above. Fraction 
1 0 G eluted from hGHbp beads contained 30 times as many CFU's as fraction G eluted from BSA-beads in this 
selection. Again, an aliquot of fraction G was propagated in WJM101 cells to yield library 1G 2 (indicating that 
this library had been twice selected by glycine elution). Double-stranded DNA (pLIB 1G 2 ) was also prepared 
from this culture. 

1 5 To reduce the level of background (Kpnl+) template, an aliquot (about 0.5 ng) of pLIB 1 G 2 was 

digested with Kpnl and electroporated into WJM101 cells. These cells were grown in the presence of K07 
(multiplicity of infection^ 100) as described for the initial library, and a new phage pool, pLIB 3, was prepared 
(Fig. 3). 

In addition, an aliquot (about 0.5 u.g) of dsDNA from the initial library (pLIBl) was digested with Kpnl 
20 and electroporated directly into WJM101 cells. Transformants were allowed to recover as above, infected with 
M13-K07, and grown overnight to obtain a new library of phage, designated phage Library 2 (Fig. 3). 



Phagemid binding, elution, and propagation were carried out in successive rounds for phagemid derived 
from both pLIB 2 and pLIB 3 (Fig. 3) as described above, except that (1) an excess (10-fold over CFU) of 
2 5 purified K07 phage (not displaying hGH) was added in the bead-binding cocktail, and (2) the hGH stepwise 
eiuuons were replaced with brief washings of buffer A alone. Also, in some cases, XL1 -Blue ceils were used for 
phagemid propagation. 

An additional digestion of dsDNA with Kpnl was carried out on pLIB 2G3 and on pLIB 3G 5 before the 
final round of bead-binding selection (Fig. 3). 

Four independently isolated clones from LIB 4G 4 and four independently isolated clones from LIB 5G 6 
were sequenced by dideoxy sequencing. Aii eight of these dones had identical DNA sequences: 

hGH codon: 172 174 176 178 

35 5' -AAG GTC TCC ACA TAC CTG AGG ATC-3' 

Thus, all these encode the same mutant of hGH: (E174S, F176Y). Residue 172 in these clones is Lys as in wild- 
type. The codon selected for 172 is also identical to wild-type hGH. This is not surprising since AAG is the only 
iysine-codon possible from a degenerate °NNS = codon set Residue 178-Arg is aiso the same as wiid-type, but 
here, the codon selected from the library was AAG instead of CGC as is found in wild-type hGH, even though the 
4 0 latter codon Is also possible using the "N NS" codon set. 
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MultltflcityofKOTlr 

The multiplicity of infection of K07 infection is an important parameter in the propagation of 
recombinant phagemids. The K07 multiplicity of infection must be high enough to insure that virtually all cells 
transformed or transfected with phagemid are able to package new phagemid particles. Furthermore, the 
concentration of wild-type gene III in each cell should be kept high to reduce the possibility of multiple hGH-gene III 
fusion molecules being displayed on each phagemid particle, thereby reducing chelate effects in binding. However, 
if the K07 multiplicity of infection is too high, ihe packaging of K07 will compete with that of recombinant 
phagemid. We find that acceptable phagemid yields, with only 1 -10% background K07 phage, are obtained when 
ihe K07 multiplicity of infection is 10Q. 

Tsfcte IV. 

hGHbp/BSA beads Fraction Kpnl 

14 0.44 
30 0.57 
1.7 0.26 
8.5 0.18 
220 0.13 

15 ND 

1.7 <0.05 
4.1 <0.10 
27 0.18 
3B ND 

Phage pools are labelled as shown (Fig. 3). The multiplicity of infection (moi) refers to the multiplicity of K07 
infection (PFU/cells) in the propagation of phagemid. The enrichmeni of CFU over PFU is shown in those cases 
where purified K07 was added in the binding step. The ratio of CFU eluting from hGHbp-beads over CFU eluting 
from BSA-beads is shown. The fraction of Kpnl-contaireng template (i.e., pH0415) remaining in the pool was 
determined by digesting dsDNA with Kpnl plus EcoRI, running the products on a 1% agarose gel, and laser- 
scanning a negative of the ethidium bromide-stained DNA. 

The fact that a singledone was isolated from two different pathways of selection (Fig. 3) suggested 
mat the double mutant (E174S,F176Y) binds strongly to hGHbp. To determine the affinity of this mutant of 
hGK for hGHbp, we constructed this mutant of hGH by site-directed mutagenesis, using a plasmid (pBQ720) 
which contains the wild-type hGH gene as template and the following oligonucleotide which changes codons 174 and 
176: 

hGH codon: 172 174 176 178 

Lys Ser Tyr Arg 

5 r_ ATG GAC AAG GTS T£G ACA TAC CTG CGC ATC GTG -3' 



Phage Pool 


moi (K07) 


Enrichment 
CFU/PFU 


LIB1 


1000 


ND 


LIB 1G 


1000 


ND 


LIB 3 


100 


ND 


LIB 3G 3 


10 


ND 


LIB 3G 4 


100 


460 


LIB 5 


100 


ND 


LIB 2 


100 


ND 


LIB 2G 


10 


ND 


LIB 2G 2 


100 


1000 


LIB 4 


100 


170 



The resulting construct, pH0458B, was transformed into E. coli strain 16C9 for expression of the mutant 
30 hormone. Scafchard analysis of competitive binding of hGH(E174S ! F176Y) versus 12 5|-hGH to hGHbp indicated 
that the (E174S.F176Y) mutant has a binding affiniiy ai ieasi 5.0-foid tighter than that of wild-type hGH. 
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EXAMPLE VIII 

SELECTION OF hGH VARIANTS FROM A 
5 HEUX-4 RANDOM CASSETTE LIBRARY OF HORMONE-PHAGE 

Human growth hormone variants were produced by the method of the present invention using the 

phagemid described in figure 9. 

Construction of a dp-fi gable rtorrnone-ohaae vector 

We designed a vector for cassette mutagenesis (Wells et ai„ Gene 34, 315-323 [1985]) and expression 

10 of the hGH-gene iii fusion protein with the objectives of (1) improving the iinkage between hGH and the gene ill 
moiety to more favorably display the hGH moiety on the phage (2) limiting expression of the fusion protein to 
obtain essentially "monovalent display,' (3) allowing for restriction nuclease selection against the starting vector, 
(4) eliminating expression of fusion protein from the starting vector, and (5) achieving fadie expression of the 
corresponding free hormone from a given hGH-gene ill fusion mutant. 

1 5 Plasmid pS0643 was constructed by oligcnuclectide-directed mutagenesis (Kunke! et a!., Methods 

Enzymol. 154. 367-382 [1987]) of pS0132, which contains pBR322 and fl origins of replication and expresses an 
hGH-gene III fusion protein (hGH residues 1-191, followed by a single Gly residue, fused to Pro-198 of gene III) 
under the control of the E. coil phoA promoter (Bass et a!., Proteins 8, 309-314 |199Q])(Figure 9). Mutagenesis 
was carried out with the oligonucleotide S'-ggc-AGC-TGT-gGC-T TC-TAG-AG T-GGC-GGC-ggC-TCT- 

2 0 GGT-3', which introduces a Xbal site (underlined) and an amber stop codon (TAG) following Phe-1 9 1 of hGH. In 

the resulting construct, pS0643, a portion of gene II! was deleted, and two silent mutations (underlined) 
occurred, yielding the following junction between hGH and gene III: 

-- hGH ,„,„„„,„„,„_> geae m °> 

25 187 188 189 190 191 am* 249 2SO 251 252 253 254 

Gly Ser Cys Gty Phe Ghi Ser Gly Gty Gty Ser Cfy 

GGC ACC TGT GGA TIC TAG AGT QCQ GGT GGC TCT GGT 

This shortens the total size of the fusion protein from 401 residues in pS0132 to 350 residues in 

3 0 pS0643. Experiments using monoclonal antibodies against hGH have demonstrated that the hGH portion of the 

new fusion protein, assembled on a phage particle, is more accessible than was the previous, longer fusion. 

For propagation of hormone-displaying phage, pS0643 and derivatives can be grown in a amber- 
suppressor strain of E. cell, such as JM101 or XL1 -Blue (Bullock et a!., BioTechniques 5, 376-379 [1987]). Shown 
above is substitution of Glu at the amber codon which occurs in sup£ suppressor strains. Suppression with other 
35 amino acids is also possible in various available strains of E.coli well known and pubiically available. 

To express hGH (or mutants) free of the gene III portion of the fusion, pS0643 and derivatives can 
simply be grown in a non-suppressor strain such as 16C9. In this case, the amber codon (TAG) leads to 
termination of translation, which yields free hormone, without the need for an independent DNA construction. 

To create sites for cassette mutagenesis, pS0843 was mutated with the oligonucleotides (1) 5-CGG- 
40 ACT-GGG-C AG-ATA-TT C-AAG-CAG-ACC-3'. which destroys the unique Ml site of pS0643; (2) 5'-CTC- 
AAG-AAC-TAC-G GG-TTA-CCC -TGA-CTG-CTT-CAG-GAA-GG-3'. which inserts a unique BsO site, a 
single-base frameshitt, and a non-amber stop codon (TGA); and (3) 5'-CGC-ATC-GTG-CAG-TGC -AGA-TCT - 
GTG-GAG-GGC-3', which introduces a new figlll site, to yield the starting vector, pH0509. The addition of a 
frameshift along with a TGA stop codon insures that no genelll-fusion can be produced from the starting vector. 
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The EsO-BflIU segment is cut out of pH0509 and replaced wiih a DNA cassette, mutated at the codons of 
interest. Other restriction sftes for cassette mutagenesis at other locations in hGH have also been introduced into 
the hormoneiJhage vector. 

CfPrffnffmiiTW^ s ^ 1hInhelly4ofhGH 

5 Codons 172, 174, 176 and 178 of hGH were targeted for random mutagenesis because they all lie on or 

near the surface of hGH and contribute significantly to receptor-binding (Cunningham and Wells, ScieO££ 244, 
1081-1085 [1989]); they all lie within a well-defined structure, occupying 2 turns' on the same side of helix 4; 
and they are each substituted by at least one amino add among known evolutionary variants of hGH. 

We chose to substitute NNS (N=A/G/C/T; S=G/G) at each of the target residues. The choice of the 
1 o NNS degenerate sequence yields 32 possible codons (including at least one codon for each amino acid) at 4 sites, 
for a total of (32)4= 1,048,576 possible nucleotide sequences, or (20) 4 = 160,000 possible polypeptide 
sequences. Only one stop codon, amber (TAG), is allowed by this die** of codons, and this codon is suppressive 
as GIu in sjjpj; strains of ELM- 

Two degenerate oligonucleotides, with NNS at codons 172, 174, 176, and 178, were synthesized, 
1 5 phosphoryiated, and annealed to construct the mutagenic cassette: S'-GT-TAC-TCT-ACT-GCT-TTC-AGG- 
AAG-GAC-ATG-GAC-NNS-GTC-NNS-ACA-NNS-CTG-NNS-ATC-GTG-CAG-TGC-A-3', and 5"-GA-TCT- 
QCA-CTG-CAC-GAT-SNN-CAG-SNN-TGT-SNN-GAC-SNN-GTC-CAT-GTC-CTT-CCT-GAA-GCA-GTA- 

GA-3 : . 

The vector was prepared by digesting pH0509 with fislfcii followed by Boiii- The products were run on 

20 a 1% agarose gel and the large fragment excised, phenol-extracted, and ethanol precipitated. This fragment was 
treated with calf intestinal phosphatase (Boehringer), then phenotchloroform extracted, ethanol precipitated, 
and resuspended for ligation with the mutagenic cassette. 
Prr-T*™ ■nman.hrsrvrn XL1-Blue cells 

Following ligation, the reaction products were again digested with £s£!l, then phenol :chloroform 

25 extracted, ethanol precipitated and resuspended in water. (A BsiEii recognition site (GGTNACC) is created 
within cassettes which contain a G at position 3 of codon 172 and an A££ (Thr) codon at 174. However, 
treatment with Mi at this step should rot select against any of the possible mutagenic cassettes, because 
virtually all cassettes will be heteroduplexes, which cannot be cleaved by the enzyme.) Approximately 150 ng (45 
fmois) of DNA was electroporated info XL1-Blue cells (1.8 x 1<J» cells in 0.045 mL) in a 02 cm cuvette at a 

3 0 voltage setting of 2.49 fcv wiih a single pulse (time constant = 4.7 msec.). 

The cells were allowed to recover 1 hour at 37°C in S.O.C media with shaking, then mixed with 25 mL 
2YT medium, 100 mg/rr.L carbenicillin, and M13-K07 (moi= 100). After 10" at 23°C, the culture was incubated 
overnight (15 hours) at 37°C wiih shaking. Rating of serial dilutions from this culture onto carbenicillin- 
containing media indicated that 3.9 x 10 7 electrotransformants were obtained. 

35 After overnight incubation, the cells were pelleted, and double-stranded DNA (dsDNA), designated 

pH0529E (the initial library), was prepared by the aikaiine iysis meihod. The supernatant was spun again to 
remove any remaining cells, and the phage, designated phage pool <t>H0529E (the initial library of phage), were 
FEG-precipitated and resuspended in 1 mL STE buffer (10 mM Tris, pH 7.6, 1 mM EDTA, 50 mM NaCI). Phage 
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titers were measured as colony-forming units (CFU) for the recombinant phagemid containing hGH-g3p. 
Approximateiy 4.5 x iO 13 CFU were obtained from the starting library. 



From the pool of electrotransformants, 58 clones were sequenced in the region of the BslEli-figlM 
5 cassette. Of these, 1 7% corresponded to the starting vector, 17% contained at least one frame shift, and 7% 
contained a non-silent (non-terminating) mutation outside the four target codons. We conclude that 41% of the 
denes were defective by one of the above measures, leaving a total functional pool of 2.0 x 10 7 initial 
transformants. This number still exceeds the possible number of DNA sequences by nearly 20-fold. Therefore, 
we are confident of having all possible sequences represented in the starting library. 
1 0 We examined the sequences of non-selected phage to evaluate the degree of codon bias in the 

mutagenesis (Table V). The results indicated rat, although some codons (and amino acids) are under- or over- 
represented relative to the random expectation, the library is extremely diverse, with no evidence of large-scale 
"sibling" degeneracy (Table VI). 

Table V. 

15 

Codon distribution (per 188 codons) of non-selected hormone phage. Clones were sequenced from the starting 
library (pH0529E). Aii codons were tabulated, including those from clones which contained spurious mutations 
and/or frameshifts. * Note: the amber stop codon (TAG) is suppressed as Glu in XL1-B!ue cells. Highlighted 
codons were over/under-represented by 50% or more. 

20 





Residue 




Number found 


Fpung/Expscisti 




Leu 


17.6 


18 


1.0 


25 


Ser 


17.6 


26 


1.5 




Arg 


17.6 


10 


0.57 




Pro 


11.S 


16 


1.4 




Thr 


11.8 


14 


1.2 


30 


Ala 


11.8 


13 


1.1 




Gly 


11.8 


16 


1.4 




Val 


11.8 


4 


0.3 




ie 


5.3 


2 


0.3 


35 


Me! 


5.9 


1 


0,2 




Tyr 


5.9 


1 


0.2 




His 


5.9 


2 


0.3 




Trp 


5.9 


2 


0.3 




Pre 


5.9 


5 


0.9 


40 


Cys 


5.S 


5 


0.9 




Gin 


5.9 


7 


1.2 




Asn 


5.9 


14 


2.4 




Lys 


5.9 


11 


1.9 




Asp 


5.9 


9 


1.5 


45 


Glu 


5.9 


6 


1.0 




amber* 


5.9 


6 


1.0 
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Table VI. 

SSSSSS— » *- <™» — • «— 
5 asGluinXLI-Biueceiisareshovwiase. 



KeNT 


ICTEQ 






NNCR 


EASL 


PeER 


FPCL 


SSKE 


LPPS 


NSDF 


ALLL 
PSHP 


SLOP 


HRPS 


QQ3N 
GSKT 


LSLe 


SYAP 


NGSK 


ASNG 


TPVT 
RSRA 


LITE 


EANN 


PSGG 


KNAK 


LCGL 


LWFP 


SRGK 


TGRL 


PAGS 


GLDG 


AKAS 


GRAK 


NOP! 


GNDD 


GTNG 





Pr^ratinn of Immobilized hfiHhP and hPRLbo 

Immobilized hGHbp ChGHbp-beads") was prepared as described (Bass et al, Pi2leiDS8, 309-314 
[1390]), except that wild-type hGHbp {Fuh et aL. J Biol. Chem. 265, 3111-3115 [1990]) was used. Competitive 
25 binding experiments with [1 25 i] hGH indicated thai 58 fmois of functional hGHbp were coupled per m o? bead 
suspension. 

Immobilized hPRLbp ("hPRLbp-beads") was prepared as above, using the 21 1 -residue extracellular 
domain of the prolactin receptor (Cunningham et al.. Science 250. 17(39-1712 [1990]). Competitive binding 
experiments with [1 25|J hGH in the presence of 50 mM zinc indicated that 2.1 fmols of functional hPRLbp were 
30 coupled per ul. of bead suspension. 

■Blank beads' were prepared by treating the oxirane-acryiamide beads with 0.6 M ethanolamine (pH 
9.2) for 15 hours at 4°C. 

Binding of hormone-phage to beads was carried out in one of the following buffers: Buffer A (PBS, 
35 0.5% BSA, 0.05% Tween 20, 0.01% thimerosal) for selections using hGHbp and blank beads; Buffer B (50 mM 
iris pH 7.5, 10 mM MgCfe 0.5% BSA, 0.05% Tween 20, 100 mM ZnCIo) for selections using hPRLbp in the 
presence of zinc (+ Zn'2+); or Buffer C (PBS, 0.5% BSA, 0.05% Tween 20, 0.01% ihimerosai, 10 mM EDTA) for 
selections using hPRLbp in the absence of zinc (+ EDTA). Binding selections were carried out according to each 
of the following paths: (1) binding to blank beads, (2) binding to hGHbp-beads, (3) binding to hPRLbp-beads (+ 
40 Zn2+), (4) binding to hPRLbp-beads (+ EDTA), (5) pre-adsorbing twice with hGHbp beads then binding the 
non-adsorbed fraction to hPRLbp-beads 

(=-hGHbp, +hFRLbp" selection), or (6) pre-adsorbing tA'ice with hPRLbp-beads then binding the non-adsorbed 
fraction to hGHbp-beads ("-hPRLbp, +hGHbp' selection). The latter two procedures are expected to enrich 
for mutants binding hPRLbp but not hGHbp, or for mutants binding hGHbp but not hPRLbp, respectively. 
45 Binding and eiuiion of phage was carried out in each cycle as follows: 

1. BINDING: An aliquot of hormone phage (typically 10 9 -10™ CFU) was mixed with an equal amount of 
non-hormone phage (pC-AT), diluted into the appropriate buffer (A. B, or C), and mixed with a 10 mL suspension 
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of hGHbp, hPRLbp or blank beads in a total volume of 200mL in a 1 .5 mL polypropylene tube. The phage were 
allowed to bind to the beads by incubating 1 hour at room temperature (23°C) with siow rotation (approximately 
7 RPM). Subsequent steps were carried out with a constant voiume of 200ui and at room temperature. 

2. WASHES: The beads were spun 15 sec, and the supernatant was removed. To reduce the number of 
phage not specifically bound, the beads were washed 5 times by resuspending briefly in the appropriate buffer, 
then pelleting. 

3. hGH ELUTION: Phage binding weakly to the beads were removed by elution with hGH. The beads 
were rotated with the appropriate buffer containing 400 n&hGH for 15-17 hours. The supernatant was saved as 
the "hGH elution" and the beads. The beads were washed by resuspending briefly in buffer and pelleting. 

4. GLYCINE ELUTION: To remove the tightest-binding phage (i.e. those still bound after the hGH 
wash), beads were suspended in Glycine buffer (Buffer A plus 0.2 ^.Glycine, pK 2.0 with HCI), rotated 1 hour and 
pelleted. The supernatant ('Glycine elution"; 200ui) was neutralized by adding 30 mL of 1 M Tris base and 
stored at 4° C. 

5. PROPAGATION: Aiiquots from the hGH elutions and from the Glycine eiutions from each set of 
beads under each set of conditions were used to infect separate cultures of log-phase XL1-Blue cells. 
Transductions were carried out by mixing phage with 1 mL XLI-Blue cells, incubating 20 min. at 37°C S then adding 
K07 (moi= 100). Cultures (25 mL 2YT plus earbeniciiiin) were grown as described above and the next pool of 
phage was prepared as described above. 

Phage binding, elution, and propagation were carried out in successive rounds, according to the cycle 
described above. For example, the phage amplified from the hGH elution from hGHbp-beads were again selected 
on hGHbp-beads and eluted with hGH, then used to infect a new culture of XL1 -Blue cells. Three to five rounds of 
selection and propagation were carried out for each of the selection procedures described above. 

From the hGH and Glycine elution steps of each cycle, an aliquot of phage was used to inoculate XL1-Biue 
cells, which were plated on. LB media containing earbeniciiiin and tetracycline to obtain independent clones from 
each phage pooi. Singie-siranded DNA was prepared from isolated coiony and sequenced in the region of the 
mutagenic cassette. The results of DNA sequencing are summarized in terms of the deduced amino acid sequences 
in Figures 5, 6, 7, and 8. 

To determine the binding affinity of some of the selected hGH mutants for the hGHbp, we transformed 
DNA from sequenced clones into ELM strain 16C9. As described above, this is a non-suppressor strain which 
terminates translation of protein after the final Phe-ISI residue of hGH. Single-stranded DNA was used for 
these transformations, but double-stranded DNA or even whole phage can be easily electroporated into a non- 
suppressor strain for expression of free hormone. 

Mutants of hGH were prepared from osmotically shocked cells by ammonium sulfate precipitation as 
described for hGH (Olson et al., N^iui£293, 408-411 [1981]), and protein concentrations were measured by 
laser densitomoetry of Coomassie-stained SDS-polyacrylamide gel electrophoresis gels, using hGH. as standard 
(Cunningham and Welis, Science 244, 1081-1085 [19893). 
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The binding affinity of each muiant was deiermined by displacement of 125 l hGH as described (Spencer 
et al.. ■). Biol. Cham. 253, 7862-7867 [1 988] ; Fuh et aL, BioL Stem. 265, 31 1 1-31 15 [1990]), using an anti- 
receptor morsodona! antibody (Mab263). 

The results tor a number of hGH muianis, selected by different pathways (Fig. 6) are shown in Table 
5 VII. Many of these mutants have a tighter binding affinity for hGHbp than wild-type hGH. The most improved 
mutant KSYR, has a binding affinity 5.6 fines greater than that of wild-type hGH. The weakest selected 
mutant among those assayed was oniy about iO-foid iower in binding affinity than hGH. 

Binding assays may be carried out for mutants selected for hPRLbp-binding. 

10 TabieVii. 

Competitive binding to hGHbp 

The selected pool in which each mutant was found is indicated as 1G (first glycine selection), 3G (third 
glycine selection), 3H (third hGH selection), 3* (third selection, not binding to hPRLbp, but binding to hQHbp). 
15 The number of times each mutant axnmiamorTgaiiseouermj cores is shown 0- 



20 



40 





Kd (nM) 


Kd(mut)/Kd(hGH) 


Pool 


KSYR (6) 


U.UO + U.U I 


0.18 


1G3G 


RSFR 


0.10 + 0.05 


0.30 


3G 


RAYR 


0.13 + 0.04 


0.37 


3* 


KTYK (2) 


0.16 + 0.04 


0.47 


H,3G 


RSYR (3) 


0.20 + 0.07 


0.58 


1G.3H.3G 


KAYR (3) 


0.22 + 0.03 


0.66 


3G 


RFFR (2) 


0.26 + 0.05 


0.76 


3H 


KQYR 


0.33 + 0.03 


1.0 


3G 


KEFR= wt (9) 


0.34+ 0.05 


1.0 


3H.3G.3* 


RTYH 


0.68 + 0.17 


2.0 


3H 


QRYR 


0.83 + 0.14 


2.5 


3* 


KKYK 


1.1 +0.4 


3.2 


3* 


RSFS (2) 


1.1 +0.2 


3.3 


3G,* 


KSNR 


3.1 +0.4 


9.2 


3* 



At some residues, substitution of a particular amino acid has essentially the same effect independent of 
50 surrounding residues. For example, substitution of F176Y in the background of 172R/174S reduces binding 
affinity by 2.0-fold (RSFR vs. RSYR). Similarly, in the background of 172K/174A the binding affinity of the 
F176Y mutant (KAYR) is 2.9-fo!d weaker than the corresponding 176F mutant (KAFR; Cunningham and Wells, 
1989). 
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On the other hand, the binding constants determined for several selected mutants of hGH demonstrate 
non-additive effects of some amine acid substitutions at residues 172, 174, 176, and 178. For example, in the 
background of 172K/176Y, the substitution E174S results in a mutant (KSYR) which binds hGHbp 3.7-fold 
tighter than the corresponding mutant containing E174A (KAYR). However, in the background of 172R/176Y, 
the effects of these El 74 substitutions are reversed. Here, the E174A mutant (RAYR) binds 1. 5-fold tighter 
than the E174S mutant (RSYR). 

Such non-additive effects on binding for substitutions at proximal residues illustrate the utility of 
protein-phage binding selection as a means of selecting optimized mutants from a library randomized at several 
positions. In the absence of detailed structural information, without such a selection process, many combinations of 
substitutions might be tried before finding the optimum mutant. 

EXAMPLE !X 

SELECTION OF hGH VARIANTS FROM A HELIX-1 RANDOM CASSETTE 
LIBRARY OF HORMONE-PHAGE 

Using the methods described in Example VIII, we targeted another region of hGH involved in binding to 
the hGHbp and/or hPRLbp, helix 1 residues 10, 14, 18, 21, for random mutagenesis in the phGHam-g3p vector 
(also known as pS0643; see Example VIII). 

We chose to use the "amber' hGH-g3 construct (called phGHam-g3p) because it appears to make the 
target protein, hGH, more accessible for binding. This is supported by data from comparative ELISA assays of 
monocionai antibody binding. Phage produced from both pS0132 (S. Bass, R. Greene, J. A. Weiis, Proteins 6, 
309 (1990).) and phGHam-g3 were tested with three antibodies (Medix 2, 1B5.G2, and 5B7.C10) that are known 
to have binding determinants near the carboxyl-terminus of hGH [B. G. Cunningham, P. Jhurani, P. Ng, J. A. Wells, 
Science 243, 1330 (1989); B. C. Cunningham and J. A. Weiis, Science 244, 1081 (1989); L. Jin and J. Weiis, 
unpublished results], and one antibody (Medix 1 ) that recognizes determinants in helices 1 and 3 ([B. C. 
Cunningham, P. Jhurani, P. Ng, J. A. Wells, &fence243, 1330 (1989); B. C. Cunningham and J. A. Wells, Science 
244, 1081 (1989)J). Phagemid particles from phGHam-g3 reacted much more strongly with antibodies Medix 2, 
1B5.G2, and 5B7.C10 than did phagemid particles from pS0132. In particular, binding of pS0132 particles was 
reduced by >2000 fo!d for both Medix 2 and 5B7.C10 and reduced by >25-fo!d for 1B5.G2 compared to binding 
to Medix 1 . On the other hand, binding of phGHam-g3 phage was weaker by only about 1 .5-fold, 1 .2-fold, and 2.3- 
fold for the Medix 2, 1B5.G2, and 5B7.C10 antibodies, respectively, compared with binding to MEDIX 1. 



We mutated residues in helix 1 that were previously identified by alanine-scanning mutagenesis [B. C. 
Cunningham, P. Jhurani, P. Ng, J. A. Wells, Science 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Science 
244, 1081 (1989), 15, 16) to modulate the binding of the extracellular domains of the hGH and/or hPRL 
receptors (called hGHbp and hPRLbp, respectively). Cassette mutagenesis was carried out essentially as 
described [J. A. Wells. M. Vasser. D. B. Powers. Gene 34, 315 (1985)]. This library was constructed by cassette 
mutagenesis that fully mutated four residues at a time (see Example VIII) which utilized a mutated version of 
phGHam-g3 into which unique Kprt (at hGH codon 27)and Xho\ (at hGH codon 6) restriction sites (underlined 
below) had been inserted by mutagenesis [ T. A. Kunkel, J. D. Roberts, R. A. Zakour, Methods EnzymoL 154, 367- 
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382] with the oligonucleotides 5 ! -GCC TTT GAG AGG TAG C AG GAG TTT G-3' and 5'-CCA ACT ATA CCA 
C TC TCG AGG TCT ATT CGA TAA C-3\ respectively. The later oligo also introduced a +1 frameshift 
(italicized) to terminate translation from the starring vector and minimize wild-type background in the phagemid 
library. This strafing vector was designated pHOSOSB. The helix 1 library, which mutated hGH residues 10, 14, 
5 18, 21, was constructed by ligating to the large Xho\-KpiA fragment of pH0508B a cassette made from the 
complementary oligonucleotides S'-dTGG AGG CTC NNS GAC AAC GCG NNS CTG CGT GCT NNS CGT CTT 
NNS CAG CTG GCC TTT GAG ACQ TAC-3* and 5'-pGT GTC AAA GGC CAG CTG SNN AAG ACQ SNN AGC 
ACG CAG SNN CGC GTT GTC SNN GAG CC-3'. The Kpd site was destroyed in the junction of the ligation 
product so that restriction enzyme digestion could be used for analysis of non-mutated background. 

1 o The library contained at least 10 7 independeni transformants so that if the library were absolutely 

random (10 6 different combinations of codons) we would have an average of about 10 copies of each possible 
mutated hGH gene. Restriction analysis using Kpnl indicated that at least 80% of helix 1 library constructs 
contained the inserted cassette. 

Binding enrichments of hGH-phage from the libraries was carried out using hGHbp immobilized on 

1 5 orirane-poiyacTjIamide beads (Sigma Chemfcal Co.) as described (Example VIII). Four residues in helix 1 (F10, 
M14, H18, and H21) were similarly mutated and after 4 and 6 cycles a non-wiid-type consensus developed (Table 
VIII). Position 10 on the hydrophobic face of helix 1 tended to be hydrophobic whereas positions 21 and 18 on the 
hydrcphillic face tended were dominated by Asn; no obvious consensus was evident for position 14 (Table IX). 
The binding constants for these mutants of hGH to hGHbp was deLermined by expressing the free 

20 hormone variants in the non-suppressor £ eo//strain 16C9, purifying the protein, and assaying by competitive 
displacement of labelled wt-hGH from hGHbp (see Example Vlll). As indicated, several mutants bind tighter to 
hGHbp than does wt-hGH. 
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Table VIII. 
Selection of hGH helix 1 mutants 
Variants of hGH (randomly mutated at residues F10, M14, H18, H21) expressed on phagemid 
5 particles were selected by binding to hGHbp-beads and eluting with hGH (0.4 mM) buffer 
followed by glycine (0.2 M, pH 2) buffer (see Example VIII). 



Gly elution 
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H21 
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Table IX 

Consensus sequences from the selected heiixl library 

Observed frequency is fraction of a!! clones sequenced with the indicated amino acid. The nominal frequency is 
calculi ™ the baa's of NNS 32 codon deoeneracy. The maximal enrichment factor varies from 1 1 to 32 
depending upon the nominal frequency value for a given residue. Values of [Kd(Ala mut)/Kd(wthGH)j for single 
alanine mutations were taken from B. C. Cunningham and J. A. Wells, Science 244, 1081 (1989); B. C. Cijnningham 
D. J. Henner, J. A. Wells, Science 247, 1461 (1990); B. C. Cunningham and j. A. Weils, Fm. Nail. Acad. Set USA 
88, 3407 (1991). 

Wild h/De Selected 
Kd(Ala mut) 
K,j(wt hGH) 



observed nominal 





F10 


5.9 


H 


0.50 


0.031 


17 








F 


0.14 


0.031 


5 








A 


0.14 


0.062 


2 




rvii4 


2.2 


G 


0.50 


0.062 


8 


20 






W 


0.14 


0.031 


5 








N 


0.14 


0.031 


5 








S 


0.14 


0.093 


2 




me 


1.6 


N 


0.50 


0.031 


17 


25 






D 


0.14 


0.031 


5 








F 


0.14 


0.031 


5 




H21 


0.33 


N 


0.79 


0.031 


25 








H 


0.07 


0.031 


2 



WO 92/09690 



41 



PCT/US91/09133 



Table X 

Binding of purified hGH helix 1 mutants to hGHhp 

Competition binding experiments were performed using [ 125 l]hGH (wild-type), hGHbp (containing the 
extracellular receptor domain, residues 1 -238), and Mab263 [B. C. Cunningham, P. Jhurani, P. Ng, J. A. Wells, 
Science 243, 1330 (1989)j;. The number P indicates the fracttonai occurrence of each muiani among aii the ciones 
sequenced after one or more rounds of selection. 





Sequence position 




p 


(nJM)\f(K(j mut) 


K^wt hGH)) 




10 


14 


18 


21 








15 


H 


Q 


N 


N 


0.50 


0.14 ± 0.04 


0.42 




A 


W 


D 


N 


0.14 


0.10 ±0.03 


0.30 




Wtm F 


M 


H 


H 


0 


0.34 ± 0,05 


(D 




F 


S 


F 


L 


0.07 


0.6810.19 


2.0 




Y 


T 


V 


N 


0.07 


0.75 ±0.19 


2.2 


20 


L 


N 


s 


H 


0.07 


0.82 ±0.20 


2.4 




I 


N 


1 


N 


0.07 


12 ±031 


3.4 



SELECTION OF hGH VARIANTS FROM A HEUX-4 RANDOM CASSETTE LIBRARY CONTAINING 
PREVIOUSLY FOUND MUTATIONS BY ENRICHMENT OF HORMONE-PHAGE 



Our experience with recruiting non-binding homoiogs ot hGH evolutionary variants suggests thai many 
individual amino acid substitutions can be combined to yield cumulatively improved mutants of hGH with respect to 
binding a particular receptor [B. C. Cunningham, D. J. Henner, J. A. Wells, Science 247, 1461 (1990); B. C. 
Cunningham and J. A. Weils, Free. Man. Acad. Sd. USA 88, 3407 (1991); H. B. Lowman, B. C. Cunningham, J. A. 
Wells, J. Biol. Chem. 266, in press (1991 )]. 

The helix 4b library was constructed in an attempt to further improve the helix 4 double mutant 
(E174S/F176Y) seiected from the heiix 4a library ihat we found bound tighter to the hGH receptor (see 
Example VIII). With the E174S/F176Y hGH mutant as the background starting hormone, residues were mutated 
that surrounded positions 174and 176 on the hydrophilic face of helix 4 (R167, D171, T175 and 1179) . 
Construction of the heiix 4b ilbrarv bv cassette mutagenesis 

Cassette mutagenesis was carried out essentially as described [J. A. Wells, M. Vasser, D. B. Powers, 
Gen§&, 315 (1885)]. The helix 4b library, which mutated residues 157, 171, 175 and 179 within the 
E174S/F176Y background, was constructed using cassette mutagenesis that fully mutated four residues at a 
time (see Example VIII) and which utilized a mutated version of phGHam-g3 into which unique Bsffll and Bgl\\ 
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restriction sites had been inserted previously (Example viii). into the BsiB\-8gni sites of the vector was inserted 
a cassette made from the complementary oligonucleotides 5*-pG TTA CTC TAC TGC TTC NNS AAG GAG ATG 
NNS AAG GTC AGO NNS TAC CTG CGC NNS GTG GAG TGC A-3" and 5'-pGA TGT GCA CTG CAC SNN 
GCG CAG GTA SNN GCT GAC CTT SNN CAT GTC CTT SNN GAA GCA GTA GA-3 1 . The SsO site was 
5 eliminated in the ligated cassette. From the helix 4b library, 15 unsetected clones were sequenced. Of these, none 
lacked a cassette insert, 20% were frame-shifted, and 7% had a non-silent mutation. 

Binding enrichments of hGH-phage from the libraries was carried out using hGHbp immobilized on 
oxirane-polyacr/amide beads (Sigma Chemical Co.) as described (Example VIII). After 6 cycles of binding a 
1 0 reasonably clear consensus developed (Table XI). inieresBngiy. ail positions tended to contain polar residues, 
notably Ser.Thr and Asn (XII). 



The binding constants for some of these muianis of hGH to hGHbp was determined by expressing the 
free hormone variants in the non-suppressor £ coli strain 16C9, purifying the protein, and assaying by competitive 
displacement of labelled wt-hGH from hGHbp (see Example VIII). .As indicated, the binding affinities of several 
helix-4b mutants for hGHbp were tighter than that of wt-hGH Tabie Xiii). 

Finally, we have begun to analyze the binding affinity of several of the tighter hGHbp binding mutants 
for their ability to bind to the hPRLJbp. The E174S/F176Y muiant binds 200-foid weaker to the nFRLbp than 
hGH. The E174T/F176Y/R178K and R167N/D171S/E174S/F176Y/I179T mutants each bind >500-fold weaker 
to the hPRLfap than hGH. Thus, it is possible to use the produce new receptor selective mutants of hGH by phage 
display technology. 

Hnnnone-Dhaoemlri selection Identifies the ItTfomtattofKnntPnrof narticular residues 

Of the 12 residues mutated in three hGH-phagemid libraries (Examples VIII, IX, X), 4 showed a strong, 
although not exclusive, conservation of the wild-type residues (K1 72, Ti 75, F1 76, and R1 78). Not surprisingly, 
these were residues that when converted to Ala caused the largest disruptions (4- to 60-fold) in binding affinity 
ioihe hGHbp. There wasaciass of 4 other residues (F10, M14, D171, and 1179) where Ala substitutions caused 
weaker effects on binding (2- to 7-fold) and these positions exhibited little wiid-type consensus. Finaiiy the other 
4 residues (H18, H21, R167, and E174), that promote binding to he hPRLbp but not the hGHbp, did not exhibit 
any consensus for the wiid-type hGH sequence by selection on hGHbp-beads. In fact two residues (E174 and 
H21), where Ala substitutions enhance binding affinity to the hGHbp by 2- to 4-fold [B. C. Cunningham, P. 
Jhurani, P. Ng, -J. A. Wells, Science 243, 1330 (1989): B. C. Cunningham and J. A. Wells, Science 244, 1081 
(1989); B. C. Cunningham, D. J. Henner, J. A. Weils, Science 247, 1481 (1SS0); B. C. Cunningham and J. A. Wells, 
Proc. Nat Acad. Sd. USA 88, 3407 (1991)1. Thus, the alanine-scanning mutagenesis data correlates reasonably 
ws!! with the flexibility to substitute each position. In fact , the reduction in binding affinity caused by alanine 
substitutions [B.C. Cunningham, P. Jhurani, P. Ng, J. A. Wells, Sconce 243, 1330(1989); B. C. Cunningham and 
J. A. Wells, SBfem»244, 1081 (1989)], B. C. Cunningham, D. J. Henner, J. A. Wells, Science 247, 1461 (1990); B. 
C. Cunningham and J. A. Wells, Proc. Nat Acad. Sd. USA 88, 3407 (1991)! is a reasonable predictor of the 
percentage that the wild-type residue is found in the phagemid pool after 3-6 rounds of selection. The alanine- 
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scanning information is useful for targeting side-chains that modulate binding, and the phage selection is 
appropriate for optimizing them and defining the flexibility cf each site {and/or combinations of sites) for 
substitution. The combination of scanning mutational methods [B. C. Cunningham, P. Jhurani, P. Ng, j. A. Wells, 
Science 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Science 244, 1081 (1989)] and phage display is a 
5 powerful approach to designing receptoNigand interfaces and studying molecular evolution in vitro. 

In cases where combined mutations in hGH have additive effects on binding affinity to receptor, 
mutations learned through hormone-phagemid enrichment to improve binding can be combined by simple cutting 
and ligation of restriction fragments or mutagenesis to yield cumulatively optimized mutants of hGH. 

10 On the other hand, mutations in one region of hGH which optimize receptor binding may be structurally or 

functionally incompatible with mutations in an overlapping or another region of the molecule. In these cases, 
hormone phagemid enrichment can be carried out by one of several variations on the iterative enrichment approach : 
(1 ) random DNA libraries can be generated in each of two (or perhaps more) regions of the molecule by cassette 
or another mutagenesis method. Thereafter, a combined library can be created by ligation of restriction 

1 5 fragments from the two DNA libraries; (2) an hGH variant, optimized for binding by mutation in one region of the 
molecule, can be randomly mutated in a second region of the molecule as in foe helix-4b library example; (3) two or 
more random iibraries can be pariiaiiy selected for improved binding by hormone-phagemid enrichmeni; after this 
"roughing-in" of the optimized binding site, the still-partially-diverse iibraries can be recombined by ligation of 
restriction fragments to generate a single library, partially diverse in two or more regions of the molecules, which 

20 in turn can be further selected for optimized binding using hormone-phagemid enrichment. 
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Table XL 

Mutant phagemlds of hGH selected from helix 4b library after4and6<vciesofenrichnienL 
Selection of hGH helix 4b mutants (randomly mutated at residues 167, 171, 175, 179), each containing the E174S/F176Y 
double mutant, by binding to hGHbp-beads and eluting with hGH (0.4 m.W) buffer followed by glycine [0.2 M, pH 2) buffer. 
5 One mutant (+) contained the spurious mutation RtTBH. 
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Tabie Xii 

Consensus sequences fron the selected library. 
Observed frequency is fraction of all clones sequenced with the indicated amino add. The nominal frequency 
is calculated on the basis of NNS 32 codon degeneracy. The maximal enrichment factor varies from 1 1 to 16 to 32 
5 depending upon the nominal frequency value for a given residue. Values of [Kjj{A!a mut)/Kjj(wt hGH)] for single 
alanine mutations were taken from refs. below; for position 175 we only have a value for the T175S mutant [B. C. 
Cunningham, P. Jhurani, P. Ng, J. A. Wells, Science 243. 1330 (1989); B. C. Cunningham and J. A. Wells, Science 244, 
1081 (198S); B. C. Cunringham, D. J. tenner, J. A. Wells, Science 247, 1461 (19S0); B. C. Cunningham and J. A. Wells, 
Proc. Nat Acad. Sci. USA 88, 3407 (1991).]. 
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25 

Table XIII 

Binding of purified hGH mutants to hGHbp. 
Competition binding experiments were performed using | 125 ijhGH (wild-type), hGHbp (containing the 
3 0 extracellular receptor domain, residues 1 -238), and Mab263 (11). The number P indicates the fractional 

occurrence of each mutant among all the clones sequenced after one or more rounds of selection. Note that the 
helix 4b mutations (*) are in the background of hGH(E174S/F176Y). In the list of helix 4b mutants,, the 
E174S/F176Y mutant (*), with wt residues at 167, 171 , 175, 179, is shown in bold. 



Sequence portion P frfr ^lffhGH) 



N S T T 0.18 0.04 ± 0.02 0.12 

E S T i 0.06 0.04 ± 0.02 0.12 

K S T L 0.06 0.05 ± 0.03 0.16 

N N T T 0.06 0.06 ±0.03 0.17 

R D T I 0 0.06 ±0.01 (0.18) 

N S T Q 0.06 0.26 ±0.11 0.77" 
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Bf AMPLE Xi 
Assembly of Molecule on the Phagemki Surface 

Piasmid pDH 188 contains the DNA encoding the Fab portion of a humanized !gG antibody, called 4D5, 
5 that recognizes the HER-2 receptor. This piasmid is contained in £ constrain SR 101, and has been deposited 
with the ATCC in Rcckville, MD. 

Brieny, the piasmid was prepared as ioiiows: the starting piasmid was pS0132, containing the alkaline 
phosphatase promoter as described above. The DNA encoding human growth hormone was excised and, after a 
series of manipulations to make the ends of the piasmid compatible for ligation, the DNA encoding 4D5 was 
1 0 inserted. The 4D5 DNA contains two genes. The first gene encodes the variable and constant regions of the light 
chain, and contains at its 5' end the DNA encoding the st II signal sequence. The second gene contains four 
portions: first, at its 5' end is the DNA encoding the st II signal sequence. This is followed by the DNA encoding 
me variable domain of the heavy chain, which is foiiowed by the DNA encoding the first domain of the heavy chain 
constant region, which in turn is followed by the DNA encoding the M13 gene III. The salient features of this 
15 construct a-esho-A-n in Figure 10. Tte seance of the IMA e»^^ 

Both polyethylene glycol (PEG) and electroporafion were used to transform plasmids into SR101 cells. 
(PEG competent cells were prepared and transformed according to the method of Ghung and Miller (Nucleic 
Acids Res. 16:3580 [1988]). Cells that were competent for eiectroporaiion were prepared, and subsequenily 

20 transformed via electroporatJon according to the method of Zabarovsky and Winberg (Nucleic Adds Res. 18:5912 
[1930j). After placing the cells in 1 ml of the SOC media (described in Sambrook etaL supraV they were grown 
for 1 hour at 37°C with shaking. At this time, the concentration of the ceils was cJeierrnined using light scattering 
at ODboo- A titered K07 phage stock was added to achieve an multiplicity of infection (MOI) of 100, and the 
phage were allowed to adhere to the cells for 20 minutes at room temperature. This mixture was then diluted into 

25 25 mis of 2YT broth (described in Sambrook ef a/., supra) and incubated with shaking at 37°C ovemighi. The 
next day, cells were pelleted by centrifugation at 5000 x g for 10 minutes, the supernatant was collected, and the 
phage part'eies were precipitated with 0.5 M NaCI and 4% PEG (final concentration) at room temperature for 10 
minutes. Phage particles were pelleted by centrifugation at 10,000 x g for 10 minutes, resuspended in 1 mi of TEN 
(10 mM Tris, pH 7.6, 1 mM EDTA, and 150 mM NaCI), and stored at 4°C. 

Aliquoteof 0.5 ml from a solution of 0.1 mg/ml of the extra-cellular domain of the HER-2 antigen (ECD) 
ora solution of 0.5 mg/ml of BSA (control antigen) in 0.1 M sodium bicarbonate. pH 8.5 were used to coat one well 
of a Faicon 1 2 weii tissue culture plate. Once She solution was applied to the walls, the plates were incubated at 
4°C on a rocking platform overnight. The plates were then blocked by removing the initial solution, applying 0.5 ml 
35 of blocking buffer (30 mg/ml BSA in 0.1 M sodium bicarbonate), and incubating at room temperature for one hour. 
Finally, the blocking buffer was removed, 1 mi of buffer A (PBS, 0.5% BSA, and 0.05% Tween-20) was added, 
and the plates were stored up to 10 days at 4°C before being used for phage selection. 
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Approximately 10 9 phage particles were mixed with a 100 fold excess of K07 helper phage and 1 ml of 
buffer A . This mixture was divided into two 05 ml aliquots; one of which was applied to ECD coated wells, and 
the other was applied to BSA coated wells. The plates were incubated at room temperature while shaking for one 
5 to three hours, and were then washed three times over a period of 30 minutes with 1 m! aliquots of buffer A. 
Elution of the phage from the plates was done at room temperature by one of two methods: 1 ) an initial overnight 
incubation of 0.025 mg/ml purified Mu4D5 antibody (murine) followed by a 30 minute incubation with 0.4 ml of 
the acid eiuiion buffer (0.2 M glycine, pH 2.1, 0.5% BSA, and 0.05% Tween-20), or 2) an incubation with the acid 
elution buffer alone. Eluates were then neutralized with 1 M Tris base, and a 05 ml aliquot of TEN was added. 
1 0 These samples were then propagated, titered, and stored at 4°C. 
Phage propagation 

Aliquots of eluted phage were added to 0.4 ml of 2YT broth and mixed with approximately 10 s mid-log 
phase male £ coii strain SR101 . Phage were allowed to adhere to the cells for 20 minutes at room temperature 
and ihen added to 5 mi of 2YT broth thai contained 50 jig/mi of earbeniciilin and 5 My/ml of tetracycline. These 
1 5 cells were grown at 37°C for 4 to 8 hours until they reached mid-log phase. The OD600 was determined, and the 
cells were superirtfected with K07 helper phage for phage production. Once phage particles were obtained, they 
were (Hered in order io determine the number of colony forming units (cfu). This was done by taking aliquots of 
serial dilutions of a given phage stock, allowing them to infect mid-log phase SR1 01 , and plating on LB plates 
containing 50 vg!m\ earbeniciilin. 

The affinity of h4D5 F a b fragments and F a b phage for the ECD antigen was determined using a 
competitive receptor binding RIA (Burt, D. R., Receptor Binding in Drug Research, O'Brien, R.A. (Ed.) pp. 3- 
29, Dekker, New York [1 986]). The ECD antigen was labeled wiih 1 25 -k>dine using ihe sequential chioramine-T 
method (De Larco, J. E. etal, J. Cell. Physiol. 109:143-152 [1981]) which produced a radioactive tracer with a 

25 specific activity of 14u.GL/iig and incorporation of 0.47 moles of Iodine per mole of receptor. A series of 0.2 ml 
solutions containing 0.5 ng (by ELiSA) of F a b or Fab phage, 50 nCi of 125 i ECD tracer, and a range of unlabeled 
ECD amounts (6.4 ng to 3277ng) were prepared and incubated at room temperature overnight. The labeled 
ECD-Fab or EGD-Fab phage complex was separated from the unbound labeled antigen by forming an aggregate 
complex induced by the addition of an anti-human igG (Fitzgerald 40-GH23) and 6% PEG 8000. The complex was 

3 0 pelleted by centrifugation (1 5,000 x g for 20 minutes) and the amount of labeled ECD (in com) was determined 
by a gamma counter. The dissociation constant (Kd) was calculated by employing a modified version of the 
program LIGAND (Munson, P. and Rothbard, D., Ana/. Biochem. 107220-239 [1980]) which utilizes Scatchard 
analysis (Scatchard, G.,Ar»rj. N.Y. Acad. Sd. 51:660-672 [1949]). The Kd values are shown in Figure 13. 

35 Murine 4D5 antibody was labeled with 125-1 to a specific activity of 40-50 u,Ci/ng using the iodogen 

procedure. Solutions containing a constant amount of labeled antibody and increasing amounts of unlabeled variant 
Fab were prepared and added to near confluent cultures of SK-BR-3 cells grown in 36-wel! microtiter dishes 
(final concentration of labeled antibody was 0.1 nM). After an overnight incubation at 4°C, the supernatant was 
removed, the cells were washed and the cell associated radioactivity was determined in a gamma counter. Kd 
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values were determined by analyzing the data using a modified version of the program LIGAND (Munson, P. and 
Rothbard, D., supra) 

This deposit of plasmid pDHISfi ATCC no. 68663 was made under the provisions of the Budapest 
Treaty on the Intemationai Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure 
and the Regulations thereunder (Budapest Treaty). This assures maintenance of a viable culture for 30 years 
from the date of deposit. The organisms will be made available by ATCC under the terms of the Budapest 
Treaty, and subject to an agreement between Genenteeh, inc. and ATCC, which assures permanent and 
unrestricted availability of the progeny of the cultures to the public upon issuance of the pertinent U.S. patent or 
upon Saying open to the public of any U. S. or foreign patent application, whichever comes first, and assures 
availability of the progeny to one determined by the U. S. Commissioner of Patents and Trademarks to be entitled 
thereto according to 35 USC §122 and the Commissioner's rules pursuant thereto (including 37 CFR §1.14 with 
particular reference to 886 OG 638). 

The assignee of the present application has agreed that if the cultures on deposit should die or be lost 
or destroyed when cultivated under suitable conditions, they will be promptly replaced on notification with a viable 
specimen of the same culture. Availability of the deposited cultures is not to be construed as a license to practice 
the invention in contravention of the rights granted under the authority of any government in accordance with its 
patent laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to 
practice the invention. The present invention is not to be limited in scope by the cultures deposited, since the 
deposited embodiments are intended as separate illustrations of certain aspects of the invention and any cultures 
that are functionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition to 
those shown and described herein will become apparent to those skilled in the art from the foregoing description 
and fall within the scope of the appended claims. 

While the Invention has necessarily been described in conjunction with preferred embodiments, one of 
ordinary skill, after reading the foregoing specification, will be able to effect various changes, substitutions of 
equivalents, and alterations to the subject matter set forth herein, without departing from the spirit and scope 
thereof. Hence, the invention can be practiced in ways other than those specifically described herein. It is 
therefore intended that the protection granted by Letters Patent hereon be limited only by the appended claims 
and equivalents thereof. 
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EXAMPLE Xii 

SELECTION OF hGK VARIANTS FROM COMBINATIONS OF HELIX-1 AND HEUX-4 HORMONE-PHAGE 
VARIANTS 

5 ConstPjetion of addiilvs variants of hGH 

According to additiviiy principles [J. A. weiis, Biochemistry 29, 8509 (1990)}, mutations in different 
parts of a protein, if they are not mutually interacting, are expected to combine to produce additive changes in the 
free energy of binding to another molecule (changes are additive in terms of MG^ing. or multiplicative in 
terms of = expf-AG/RTj ). Thus a mutation producing a 2-roid increase in binding affinity, when combined 

1 0 with a second mutation causing a 3-fold increase, would be predicted to yield a double mutant with a 6-fold 
increased affinity ever the starting variant. 

To test whether multiple mutations obtained from hGH-phage selections would produce cumulatively 
favorable effects on hGHbp (hGH-binding protein; the extracellular domain of the hGH receptor) binding, we 
combined mutations found in the three tightest-binding variants of hGH from the helix-1 library {Example IX: 

1 5 F10A/M14W/H18D/H21N, F10H/M14G/H18N/H21N, and F10F/M14S/H18F/H21L) with those found in the 
three tightest binding variants found in the helix-4b library (Example X: R 1 67N/D1 71 S/T1 75/11 79T, 
R1S7E/D171S/T175/I17S, and R167N/D171 N/T175/S179T). 

hGH-phagemid double-stranded DNA (dsDNA) from each of the one-helix variants was isolated and 
digested with the restriction enzymes Ecoffl and fls/XI. The large fragment from each helix-4b variant was then 

20 isolated and ligated with the small fragment train each helix-1 variant to yield the new two-helix variants shown in 
Table XIII. All of these variants also contained the mutations E174S/F176Y obtained in earlier hGH-phage binding 
selections (see Example X for details). 
Cygfijctftn, of geiectfte combinatorial libraries of hqK 

Although additivity principles appear to hold for a number of combinations of mutations, some 

25 combinations (e.g. E174S with F176Y) are clearly non-additive (see examples VIII and X). In order to identify 

with certainty the Bghtest winding variant with, for example, 4 mutations in helix-1 and. 4 mutations in helix-4, one 
would ideally mutate all 8 residues at once and then sort the pool for the globally tightest binding variant. 
However, such a pool would consist of 1.1 x 10 12 DNA sequences (utilizing NNScodon degeneracy) encoding 2.6 x 
10 10 different polypeptides. Obtaining a random phagernid library large enough to assure representation of all 

30 variants (perhaps 10 13 transformants) is not practical using current transformation technology. 

We have addressed this difficulty first by utilizing successive rounds of mutagenesis, taking the 
tightest binding variant from one library, men mutating other residues to further improve binding (Example X). 
In a second method, we have utilized the principle of additivity to combine the best mutations from two 
independently sorted libraries to create multiple mutants with improved binding (described above). Here, we 

35 further searched mrough ihe possibie combinations of mutations at positions 10, 14, 18,21, 167, 171, 175,and 
179 in hGH, by creating combinatorial libraries of random or partially-random mutants. We constructed three 
different combinatorial libraries of hGH-phagemids, using the pooled phagemlds from the helix 1 library 
(independently sorted for 0, 2, or 4 cycies; Example ix) and the pool from the heiix-4b library (independently 
sorted for 0, 2, or 4 cycles; Example X) and sorted the combined variant pool for hGHbp binding. Since some 

4 Q amount of sequence diversity exists in each of these pools, the resulting combinatorial library can explore more 
sequence combinations than what we might construct manually (e.g. Table XIII). 
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hGH-pnagemki doubie-siranded DNA (dsDNA) from each of the one-helix library pools (selected for 0, 
2, or4 rounds) was isolated and digested with the restriction enzymes AaA and flsttl. The large fragment from 
each he!ix-l variant poo! was then isolated and ligated with the small fragment from each helbc-4b variant pool to 
yield the three combinaioriai libraries pH0707A (unseiected helix 1 and helix 4b pools, as described in examples IX 
5 and X), pH0707B (twice-selected helix-1 pool with twice-selected helix-4b pool), and pH0707C (4-Hmes 
selected helix-1 poo! with 4-Smes selected he!ix-4b pool). Duplicate ligations were also set up with less DNA and 
designated as pHo707D, pH0707E, and pH0707F, eonesponding to the 0-.2-, and 4=round starting libraries 
respectively. All of these variant pools also contained the mutations E174S/F176Y obtained in earlier hGH-phage 
binding selections (see Example X for deans). 

The ligation products pH0707A-F were processed and electro-transformed into XLI-Blue cells as 
described (Example VIII). Based on colony-forming units (CFUV the number of transformants obtained from each 
pool was as follows: 2.4x10 s from pH0707A, 1.8x108 from pH0707B, 1.6x10^ from pH0707C, 8x10 s from 
DH0707D, 3x1 0 5 from pH0707E, and 4x10 5 from pH0707F. hGH-phagemid particles were prepared and 

1 5 selected for hGHbp-binding over 2 to 7 cydes as described in Example VIII. 
papM Mrfna of hGH-Dhanwnld libraries 

In addition to sorting phagemid libraries for tight-binding protein variants, as measured by equilibrium 
binding affinity, it is of interest to sort for variants which are altered in either the on-rate (kon) or the off-rate 
(koff) of binding to a receptor or other moiecuie. From thermodynamics, these rates are related to the 

20 equilibrium dissodation constant, Kd - (koff/kon). We envision that certain variants of a particular protein have 
similar K^fs for binding white having very different k^'s and koffs. Conversely, changes in Kd from one variant to 
another may be due to effects on kon, effects on koff, or both. The pharmacological properties of a protein may 
be dependent on binding affinity or on kon or koff, departing on the detailed mechanism of action. Here, we sought 
to identify hGH variants with higher on-rates to investigate the effects of changes in ko n . We envision that the 

25 selection could alternatively be weighted toward koff by increasing the binding rime and increasing the wash time 
and/or concentration with cognate ligand (hGH). 

From tjme-course analysis of wild-type hGH-phagemid binding to immobilized hQHbp, it appears that, 
of the total hGH-phagemid particles that can be eluted in the final pri 2 wash (see Example Viii for the complete 
binding and elution protocol), less than 10% are bound after 1 minute of incubation, while greater than 90% are 

30 bound after 1 5 minutes of incubation. 

For "rapid-binding selection; phagemid particles from the pH0707B pool (twice-selected for helices 1 
and 4 independenfjy) were incubated with immobilized hGHbp for only 1 minute, then washed six times with 1 mL of 
binding buffer; the hGH-wash step was omitted; and the remaining hGH-phagemid partides were eluted with a 
pH2 (0.2M glydne in binding buffer) wash. Enrichment of hGH-phagemid partides over non-displaying partides 

35 indicated that even with a short binding period and nocognate-ligand (hGH) challenge, hGH-phagemid binding 
selection sorts tight-binding variants out of a randomized pool. 
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The binding constants for some of these mutants of hGH to hGHbp was determined b] 
free hormone variants in the non-suppressor £ coli strain 16C9 or 34B8, purifying the protein, and assaying by 
competitive displacement of labelled wt-hGH from hGHbp (see Example VIII) in a radio-immunoprecipitation assay. 
In Table XII! -A below, all the variants have g!utamatei74 replaced by serinei 74 and pheny!a!aninei76 replaced by 
tyrosinei76 (E174S and F1 176Y) plus the additional substitutions as indicated at hGH amino acid positions 10, 
14, 18, 21, 167, 171, 175 and 179. 



Table Xlll-A 

hGH variants front addition of heJix-1 and heh4b mutations 
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25 In Table XIV below, hGH variants were selected from combinatorial libraries by the phagemid binding 

selection process. All hGH variants in Table XIV contain two background mutations (E174S/F176Y). hGH- 
phagemid pools from the libraries pH0707A (Part A), pH0707B and pH0707E (Part B), or pH0707C (Part C) 
were sorted for 2 to 7 cycles for binding to hGHbp. The number £ indicates the fractional occurrence of each 
variant type among the set of clones sequenced from each pool. 
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Table XIV 
hGHvariantefrtmhon«>nei>fiag^^ 



wild-type residue: 

_P Variant 

Part A: 4 cycles: 

0.60 H0714A.1 

0.40 H0714A.4 
Part B: 

2 cycles: 

0.13 H0712B.1 

0.13 H0712B.2 

0.13 H0712B.4 

0.13 H0712B.5 

n.13 H0712B.6 

0.13 H0712B.7 

0.13 H0712B.8 

0.13 H0712E.1 

0.13 M0712E.2 

0.13 H0712E.3 

0.13 H0712E.4 

0.13 H0712E.5 

0.13 H0712E.6 

0.13 H0712E.7 

0.13 H0712E.S 
4 cycles: 

0.17 H0714B.1 

0.17 H0714B.2 

0.17 H0714B.3 

0.17 HC714B.4 

Q.17 H0714B.5 

0.17 H0714B.6 
7 cycles: 

0.57 H0717B.1 

0.14 H0717B.2 

0.14 H0717B.6 

0.14 H0717B.7 

0.50 H0717E.1 
Part C: 

4 cycles: 

0.67 H0714C.2 



45 * = also contained the mutations L15R, K168R. 
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In Table XV below, hGH variants were selected from combinaioriai libraries by the phagernid binding 
selection process. All hGH variants in Table XV contain two background mutat'ons (E174S/F176Y). The number 
50 £ is the fractional occurrence of a given variant among a!! clones sequenced after 4 cycles of rapid-binding 
selection. 
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Table XV 

hQH variants from RAPID hGHbp binding selection of an hQH-phagemid combinatorial library 



wiid-iype 
_£ 


residue: 
Variant 


m 


M14 


HIS 


H21 


R167 


D171 


7175 


1173 


0.14 


H07BF4.2 


W 


G 


S 


S 


R 


D 


T 


1 


0.57 


H07BF4.3 


M 


A 


u 


N 


N 


S 


T 


T 


0.14 


H07BF4.6 


A 


W 


D 


N 


S 


S 


V 


T* 


0.14 


H07BF4.7 


H 


Q 


T 


S 


R 


D 


T 


1 



+ = also contained the mutation Y176F (wild-type hQH a!so contains F176). 



!n table XV! below, binding constants were measured by competitive displacement ot 1 25 l-labelled 
hormone H0650BD or iabeiied hGH using hGHbp (1-238) and either Mab5 or Mab263. The variant H0B50BD 
appears bind more than 30-fold tighter than wild-type hGH. 
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Table xvi 



EqtiUbrium binding constants of seieeied hGK variants. 



5 


hGH 
Variant 


Kdh/ariantt 
Kd(H0650BD) 


Kdiyarianll 
Kd(hGH) 


Kd (pM) 




hGH 


32 


-1- 


340 ± 50 


10 


H0650BD 


-1- 


0.031 


10±3 




H0650BF 


1.5 


0.045 


15±5 




H0714B.6 


3.4 


0.0SS 


34 i 19 


15 


H0712B.7 


7.4 


0.22 


74 ± 30 




H0712E.2 


16 


0.48 


60 ±70 



EXAMPLE XIII 



Selective enrichment of hGrHxiage containing a proiease substrate sequence versus rawsubsirate phage 

25 As described in Example !, the pJasmid PS0132 contains the gene for hGH iused to the residue Pro198 of the gene 
III protein with the insertion of an extra glycine residue. This plasmid may be used to produce h.GH-phage parties 
in which the hGH-gene III fusion productis displayed monovalent!/ on the phage surface (Example IV). The fusion 
protein comprises the entire hGH protein fused to the carboxy terminal domain of gene III via a flexible linker 
sequence. 

30 To investigate the feasibility of using phage display technology to select favourable substrate 

sequences for a given proteolytic enzyme, a genetically engineered variant of subtilisin BPN' was used. (Carter, P. 
et al. Proteins: Structure, function and genetics 6:240-248 (1989)). This variant (hereafter referred to as 
A64SAL subtilisin) contains the following mutations: Ser24Cys, His64Ala, Glu156Ser, Gly169Ala and 
Tyr217Leu. Since this enzyme lacks the essentia! catalytic residue His64, its substrate specificity is greatly 

35 restricted so that certain histidine-coniaining substrates are preferentially hyrdroiysed (Carter et a!., Science 
237:394-399 (1987)). 
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Construction of a hGH-abstrate-phaoe veflsL. 

The sequence of the linker region in pS0132 was mutated to create a substrate sequence for A64SAL 
subtilisin, using the oligonucleotide S'-TTC-GGG-CCC-TTC-GCT-GCT-CAC-TAT-ACG-CGT-CAG-TCG- 
ACT-GAC-CTG-CCT-3'. This resulted in the introduction of the protein sequence Phe-Gly-Pro-Phe-Ala-Ala- 
5 His-Tyr-Thr-Arg-Gin-Ser-Thr-Asp in the linker region between hGH and the carboxy terminal domain of gene ill, 
where the first Phe residue in the above sequence is Phe19l of hGH. The sequence Ala-Ala-His-Tyr-Thr-Agr-GIn 
is known to be a good substrate for A64SAL subtilisin (Carter et al (1989), supra). The resulting plasmid was 
designated pS0640. 

Phagemid particies derived from pS0132 and pS0640 were constructed as described 
in Example I. In initial experiments, a 10uJ aliquot of each phage pool was separately mixed with 
30uJ of oxirane beads {prepared as described in Example II) in IOOuJ of buffer comprising 
20mfvi Tris-HCi pH 8.6 and 2.5M NaCi. The binding and washing steps were performed as 

1 5 described in example VII. The beads were then resuspended in 400u.l of the same buffer, with 
or without SQn.M of A64SAL subtilisin. Following incubation for 10 minutes, the supernatants 
were collected and the phage titres (cfu) measured. Table XVII shows that approximately 10 
times more substrate-containing phagemid particles (pS0640) were eluted in the presence of 
enzyme than in the absence of enzyme, or than in the case of the non-substrate phagemids 

20 (pS0l32) in the presence or absence of enzyme, increasing the enzyme, phagemid or bead 
concentrations did not improve this ratio. 

In an attempt to decrease the non-specific elution of immobilised phagemids, a tight- 
25 binding variant of hGH was introduced in place of the wild-type hGH gene in pSOl 32 and 

pS0640. The hGH variant used was as described in example XI (pH0650bd) and contains the 
mutations PhelOAIa, MetUTrp, His18Asp, His21Asn, Arg167Asn, Asp171Ser, Glu174Ser, 
Phe176Tyr and !!e179Thr. This resulted in the construction of two new phagemids: pDM0380 
(containing tight-binding hGH and no substrate sequence) and pDM041 1 (containing tight- 
30 binding hGH and the substrate sequence Ala-Ala-His-Tyr-Thr-Agr-GIn). The binding washing 
and elution protocol was also changed as follows: 

(i) Binding: COSTAR 1 2-well tissue culture plates were coated for 1 6 hours with 
0.5ml/well 2ug/ml hGHbp in sodium carbonate buffer pH 10.0. The plates were then incubated 
with 1 ml/well of blocking buffer (phosphate buffered saline (PBS) containing 0.1%w/v bovine 
35 serum albumen) for 2 hours and washed in an assay buffer containing 1 0mM Tris-HCI pH 7.5, 
imM EDTA and lOOmM NaCl. Phagemids were again prepared as described in Example I: the 
phage pool was diluted 1 :4 in the above assay buffer and 0.5mi of phage incubated per weii for 
2 hours. 
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(ii) Washing; The piates were washed thoroughly with PBS + 0.05% Tween 20 
and incubated for 30 minuted with 1 ml of this wash buffer. Tfiis washing step was repeated 
three times. 

(ii) Eution: The piates were incubated for i 0 minutes in an eiution buffer 
5 consisting of 20mM Tris-HCI pH 8 6 + 1 0OmM NaCI, then the phage were eluted with 0.5mi of 
the above buffer with or without 500nM of A64SAL subtilisin. 

Table XVII shows that there was a dramatic increase in the ratio of specifically eluted 
substrate-phagemid particles compared to the method previously described for pS0640 and 
pS0132. It is likely that this is due to the fact that the tight-binding hGH mutant has a 
1 0 significantly slower off-rate for binding to hGH binding protein compared to wild-type hGH. 
Table XVI! 

Specific eiution of substrate-phagemlds by A64SAL subtilisin 

1 5 Colony forming units (cfu) were estimated by plating out 1 0ul of f 0-fo!d dilutions of phage on 
lOui spots of XL-1 blue cells, on LB agar plates containing 50ug/m! carbeniallm! 

(i) Wild-type hGH gene: binding to hGHbp-oxirane beads 

+ 50nM AS4SAL no en z yme 



PS0640 (substrate) 3x10%u/10ul 1.5x1p6cfu/10nl 

pS0132 (non-subsiraie) 6x10 5 cfu/10u! 3x10 5 cfu/10-i! 

25 

(ii) pH0650bd mutant hGH gene: binding to hGHbp-coated plates 

phagemid t SQnM A54SAL ng enzyme 

30 

PDM0411 (substrate) 1.7x10 5 CfU/10ul 2x103 G fu/10n! 

PDM0390 (non-substrate) 2x10 3 cfu/10pJ 1x103cfu/10uJ 

35 Example XIV 

Identification of preferred substrates for A64SAL subtilisin using selective 
enrichment of a library of substrate sequences. 

40 We sought to employ the selective enrichment procedure described in Example XII! to 

identify good substrate sequences from a library of random substrate sequences. 

Hnngtmotion of a y prtnrfnr insertion of randomised substrate CBSSettCS 

We designed a vector suitable for introduction of randomised substrate cassettes, and 
45 subsequent expression of a library of substrate sequences. The starting point was the vector 
pS0643, described in Example Vlli. Site-directed mutagenesis was carried out using the 
oligonucleotide 5'-AGC-TGT-GGC-TTC-GIa&CI^ 

TCT-3', which introduces 6S3l (GGGCGC) and SaJl (GTCGAC) restriction sites between hGH 
and Gene III. This new construct was designated pDM0253 (The actual sequence of pDM0253 
50 is 5^AGC-TGT-GGC-TTC-GGG-CCC-GCC-C^^ where 
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the underlined base substitution is due to a spurious error in the mutagenic oligonucleotide), 
in addition, the tight-binding hGH variant described in example was introduced by exchanging 
a fragment from pDM041 1 (example XIII) The resulting library vector was designated 
DDM0454. 

5 

Preparation of the library cassette vector ana insertion of the mutagenic 
cassette 

To introduce a library cassette, pDM0454 was digested with Apai followed by Sail, then 
precipitated with 13% PEG 8000+ 10mM MgCfc, washed twice in 70% ethanol and 

1 0 resuspended This efficiently precipitates the vector but leaves the small Apa-Sal fragment in 
solution (Paiihankar, K. R. and Prasad, K. S. N., Nucleic Acids Research 13:1346). The 
product was run on a 1 % agarose gel and the Apal-Sall digested vector excised, purified using 
a Bandprep kit (Pharmacia) and resuspended for ligation with the mutagenic cassette. 

The cassette io be inserted contained a DNA sequence similar to thai in the iinker 

1 5 region of pS0640 and pDM041 1 , but with the codons for the histidine and tyrosine residues in 
the substrate sequence replaced by randomised codons. We chose to substitute NNS 
(N=G/A/T/C; S=G/'C) ai each of the randomised positions as described in exampie VIII. The 
oligonucleotides used in the mutagenic cassettes were: 5-C-TTC-GCT-GCT-NNS-NNS-ACC- 
CGG-GAA-3' (coding strand) and B'-T-CGA-TTG-CCG-GGT-SNN-SNN-AGC-AGG-GAA-QQQ- 

20 CC-3 : (non-coding strand). This cassette aiso destroys the Saii site, so that digestion with Saii 
may be used to reduce the vector background. The oligonucleotides were not 
phosphorylated before insertion into the Apa-Sa! cassette site, as it was feared that 
subsequent oiigomerisaiion of a small population of the cassettes may iead to spurious results 
with multiple cassette inserts. Following annealing and ligation, the reaction products were 

25 phenokchloroform extracted, ethanol precipitated and resuspended in water. Initially, no 

digestion with Sail to reduce the background vector was performed. Approximately 200ng was 
electroporated into XL-1 blue cells and a phagemid library was prepared as described in 
example VIII. 

30 Selection of highly cleayable substrates from the substrate library 

The selection procedure used was identical to that described for pDM0411 and 
pDM0390 in exampie XIII. After each round of selection, the eluted phage were propagated by 
transducing a fresh culture of XL-1 blue cells and propagating a new phagemid library as 
described for hGH-phage in example VIII. The progress of the selection procedure was 
35 monitored by measuring eluted phage titres and by sequencing individual clones after each 
round of selection. 

Table A shows the successive phage titres for eiution in the presence and absence of 
enzyme after 1 , 2 and 3 rounds of selection. 
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Clearly, the ratio of specif icaiiy eiuted phage: non-specificaiiy eiuted phage (ie phage eiuted 
with enzyme:phage eiuted without enzyme) increases dramatically from round 1 to round 3, 
suggesting that the population of good substrates is increasing with each round of selection. 
Sequencing of 10 isolates from the starting library showed them all to consist of the 
5 wild-type pDM0464 sequence. This is attributed to the fact that after digestion with Apai, the 
Sail site is very close to the end of the DNA fragment, thus leading to low efficiency of 
digestion. Nevertheless, there are only 400 possible sequences in the library, so this 
population should still be well represented. 

Tables B1 and B2 shows the sequences of isolates obtained after round 2 and round 

10 3 of selection. After 2 rounds of selection, there is clearly a high incidence of histidine 
residues. This is exactly what is expected: as described in example XIII, A64SAL subtilisin 
requires a histidine residue in the substrate as it employs a substrate-assisted catalytic 
mechanism. After 3 rounds of selection, each of the 10 clones sequenced has a histidine in 
the randomised cassette. Note, however, that 2 of the sequences are of pDM041 1 , which was 

1 5 not present in the starting library and is therefore a contaminant. 
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Tanie A 

Titration of initial phage pools and eluted phage from 3 rounds of selective 
enrichment 



Colony forming units (cfu) were estimated by plating out 10ul of 10-fold dilutions of phage on 
lOui spots of XL-1 blue ceils, on LB agar plates containing 50ug/mi carbeniciilin 

ROUND 1 



Starting library: 

LIBRARY: 

pDM0411: 
(control) 

ROUND 2 

Round 1 library: 

LIBRARY: 



pDM0411: 
(control) 



Round 2 library: 

LIBRARY: 



pDM0411: 
(control) 



3x10 12 cfu/ml 

+500nM A64SAL 
no enzyme 

+500nM A64SAL 
no enzyme 



7x1 0 1 2 cfu/ml 

+500nM A64SAL 
no enzyme 

+500nM A64SAL 
no enzyme 



7x1 0 11 cfu/m! 

+500nM A64SAL 
no enzyme 

+500nM A64SAL 
no enzyme 



4x1 0 3 cfu/10ul 
3x1 0 3 cfu/10ul 

2x1 0 6 cfu/10u.l 
8x1 0 3 cfu/iOuj 



3x1 0 4 cfu/10ul 
6x1 0 3 cfu/10ul 

3x1 0 6 cfu/10ul 
1.6x10 4 cfu/lOul 



1x105cfu/10ul 
<10 3 cfu/10ul 

5x1 0 6 cfu/10ul 
3x10 4 cfu/10u.l 
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Table B1 

Sequences of eluted phage after 2 rounds of selective enrichment. 

5 All protein sequences should be of the form AA**TRQ, where * represents a 

randomised codon. in the table below, the randomised codons and amino acids are underlined 
and in bold. 
After round 2; 



A A fi 2 " T R Q 
GCT GOT car. TAC ACC CGG CAA 



A A fl H T R Q 

GCT GCT CAC ATG ACC CGG CAA 



A A i tt T R Q 

GCT GCT CVTCi CAC ACC CGG CAA 



A A 1 H T R Q 
GCT GCT rVTG CAC ACC CGG CAA 



a a a i r q 

GCT GCT CAC ACC CUCi CAA 



A A 2 a T R Q 
GCT GCT CAC ACC CGG CAA 



. . wild-type pDM0454 



# - spurious deletion of 1 codon within the cassette 
## - ambiguous sequence 
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Tabie B2 

Sequences of siutsd phase after 3 rounds of selective enrichment. 

5 AH protein sequences shouid be of the form AA"TRQ, where 4 represents a 

randomised codon. In the table below, the randomised codons and amino acids are underlined 
and in bold. 

Alter round 3; 

10 

Sequence No. of 

occurrences 

A A fl 2 T R Q 

15 , , , GCT GCT CAC TAT ACG CGT CAG ... 2 # 



A A i fl T R Q 

GCT GCT CTC CAC ACC CGG CAA 



A A fl H T R Q 
GCT GCT CAG CAC ACC CGG CAA 



A A X H T R Q 
GCT GCT ACG CAC ACC CGG CAA 



A A fl fi R Q 

. . . GCT GCT CAC TCC CGG CAA 



A A S £ T R Q 
GCT GCT CAT CAT ACC CGG CAA 



A A H £ R Q 

GCT GCT CAC TTC CGG CAA 



A A fl I R Q 

GCT GCT CAC ACC CGG CAA 



contaminating sequence from pDM0411 

contains the "illegal" codon CAT - T should not appear in the 3rd position of a 
codon. 



45 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Genentech, Inc. 

Garrard. Lisa J. 
Henner, Dennis J. 
Bass, Steven 
Greene , Ronald 
Lowman, Henry B. 
Wells, James A. 
Matthews, David J. 

(ii) TITLE OF INVENTION: Enrichment Method For Variant Proteins With 
Altered Binding Properties 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 460 Point San Bruno Blvd 

(C) CITY: south San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 34080 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 5.25 inch, 360 Kb floppy disk 

(B) COMPUTER i IBM PC compatible 

(O) OPERATING SYSTEM: PC-DOS /MS— DOS 
(D) SOFTWARE: patin (Genentech) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 03-DEC-91 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/743614 

(B) APPLICATION DATE: 09-Aug-91 

fviil PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/715300 

(B) APPLICATION DATE: 14-June-91 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/683400 

(B) APPLICATION DATE: iU-Apr-yl 

(vLL\ PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/621667 

(B) APPLICATION DATE: OJ-Dec-lyyu 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME': Benson, Robert H. 

(B) REGISTRATION NUMBER: 3U,446 

(C) REFERENCE /DOCKET NUMBER: 645P4 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415/266-1489 

(B) TELEFAX: 415/952-9881 

(C) TELEX: 910/371-7168 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 
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(C) STRANDED NESS : single 

(D ) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOsls 

5 

GGCAGCTGTG GGtTGTAGAG TGGCGGCGGC TCTGGT 36 



10 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

20 

AGCTGTGGCT TCGGGCCCTT AGCATTTAAT GCGGTA 36 



(2) INFORMATION FOR SEQ ID NO: 3: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



35 TTCACAAACG AAGGGCCCCT AATTAAAGCC AGA 33 



(2) INFORMATION FOR SEQ ID NO: 4: 

40 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



CAATAATAAC GGGCTAGCCA AAAGAACTGG 30 

50 

(2) INFORMATION FOR SEQ ID NO; 5: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CACGACAGAA TTCCCGACTG GAAA 24 

65 

(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 23 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:6i 
CTGTTTCTAG AGTGAAATTG TTA 23 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ACATTCCTGG GTACCGTGCA G 21 

{2} INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOs8: 

GCTTCAGGAA GGACATC-GAC NNSGTCNNSA CANNSCTGNN SATCGTGCAG 50 
TGCCGCTCTG TGG 63 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
fA) LENGTH: 24 bases 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AAGGTCTCCA CATACCTGAG GATC 24 

(2) INFORMATION FOR SEQ ID NOslO; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPEs nucleic acid 
fCI STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



ATGGACAAGG TGTCGACATA CCTGCGCATC GTG 33 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) Ti'FE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; lis 

GGCAGCTGTG GCTTCTAGAG TGGCGGCGGC TCTGGT 36 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GGCAGCTGTG GATTCTAGAG TGGCGGTGGC TCTGGT 36 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gly Ser Cys Gly Phe Glu Ser Gly Gly Gly Ser Gly 
1 5 10 12 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CGGACTGGGC AGATATTCAA GCAGACC 27 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CTCAAGAACT ACGGGTTACC CTGACTGCTT CAGGAAGG 38 
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(2) INFORMATION FOR SEQ ID NO: 16: 

5 (i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGCATCGTGC AGTGCAGATC TGTGGAGGGC 30 

15 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTHS 66 bases 

(Bj TYPE : nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



GTTACTCTAC TGCTTTCAGG AAGGACATGG ACNNSGTCNN SACANNSCTCi au 

30 

NNSATCGTGC AGTGCA 66 



(2) INFORMATION FOR SEQ ID NO: 18: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 64 bases 

(B) TYPE? nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



45 GATCTGCACT GCACGATSNN CAGSNNTGTS NNGACSNNGT CCATGTCCTT 50 



CCTGAAGCAG TAGA 64 

50 

(2) INFORMATION FOR SEQ ID NOsl9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOFOLOGYs linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

60 

GCCTTTGACA GGTACCAGGA GTTTG 25 



65 (2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 bases 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



CCAACTATAC CACTCTCGAG GTCTATTCGA TAA 33 



10 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 66 bases 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

20 

TCGAGGCTCN NSGACAACGC GNNSCTGCGT GCTNNSCGTC TTNNS CAGCT 50 



25 GGCCTTTGAC ACGTAC 66 



(2) INFORMATION FOR SEQ ID NO: 22: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 22: 



GTGTCAAAGG CCAGCTGSNN AAGACGSNNA GCACGCAGSN NCGCGTTGTC 50 

40 

SNNGAGCC 58 



45 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 65 bases 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

55 

GTTACTCTAC TGCTTCNNSA AGGACATGNN SAAGGTCAGC NNSTACCTGC 50 



GCNNSGTGCA GTGCA 65 

60 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
65 (A) LENGTH: 64 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

GATCTGCACT GCACSNNGCG CAGGTASNNG CTGACCTTSN NCATGTCCTT 50 
SNNGAAGCAG TAGA 64 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2178 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO s 2 5 » 

ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 

TGCTACAAAC GCGTACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 

TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCCG TGCCAGTCAG 150 

GATGTGAATA CTGCTGTAGC CTGGTATCAA CAGAAACCAG GAAAAGCTCC 200 

GAAACTACTG AtTTACTCGG CATCCTTCCT CTACTCTGGA GTCCCTTCTC 250 

GCTTCTCTGG ATCCAGATCT GGGACGGATT TCACTCTGAC CATCAGCAGT 300 

CTGCAGCCGG AAGACTTCGC AACTTATTAC TGTCAGCAAC ATTATACTAC 350 

TCCTCCCACG TTCGGACAGG GTACCAAGGT GGAGATCAAA CGAACTGTGG 400 

CTGCACCATC TGTCTTCATC TTCCCGCCAT CTGATGAGCA GTTGAAATCT 450 

GGAACTGCCT CTGTTGTGTG CCTGCTGAAT AACTTCTATC CCAGAGAGGC 500 

CAAAGTACAG TGGAAGGTGG ATAACGCCCT CCAATCGGGT AACTCCCAGG 550 

AGAGTGTCAC AGAGCAGGAC AGCAAGGACA GCACCTACAG CCTCAGCAGC 600 

ACCCTGACGC TGAGCAAAGC AGACTACGAG AAACACAAAG TCTACGCCTG 650 

CGAAGTCACC CATCAGGGCC TGAGCTCGCC CGTCACAAAG AGCTTCAACA 700 

GGGGAGAGTG TTAAGCTGAT CCTCTACGCC GGACGCATCG TGGCCCTAGT 750 

ACGCAAGTTC ACGTAAAAAG GGTATCTAGA GGTTGAGGTG ATTTTATGAA 800 
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AAAGAATATC GCATTTCTTC TTGCATCTAT GTTCGTTTTT TCTATTGCTA 850 
CAAACGCGTA CGCTGAGGTT CAGCTGGTGG AGTCTGGCGG TGGCCTGGTG 900 

5 

CAGCCAGGGG GCTCACTCCG TTTGTCCTGT GCAGCTTCTG GCTTCAACAT 950 
10 TAAAGACACC TATATACACT GGGTGCGTCA GGCCCCGGGT AAGGGCCTGG 1000 

AATGGGTTGC AAGGATTTAT CCTACGAATG GTTATACTAG ATATGCCGAT 1050 

15 

AGCGTCAAGG GCCGTTTCAC TATAAGCGCA GACACATCCA AAAACACAGC 1100 
CTACCTGCAG ATGAACAGCC TGCGTGCTGA GGACACTGCC GTCTATTATT 1150 

20 

GTTCTAGATG GGGAGGGGAC GGCTTCTATG CTATGGACTA CTGGGGTCAA 1200 
25 GGAACCCTGG TCACCGTCTC CTCGGCCTCC ACCAAGGGCC CATCGGTCTT 1250 

CCCCCTGGCA CCCTCCTCCA AGAGCACCTC TGGGGGCACA GCGGCCCTGG 1300 

30 

GCTGCCTGGT CAAGGACTAC TTCCCCGAAC CGGTGACGGT GTCGTGGAAC 1350 
TCAGGCGCCC TGACCAGCGG CGTGCACACC TTCCCGGCTG TCCTACAGTC 1400 

35 

CTCAGGACTC TACTCCCTCA GCAGCGTGGT GACTGTGCCC TCTAGCAGCT 1450 
40 TGGGCACCCA GACCTACATC TGCAACGTGA ATCACAAGCC CAGCAACACC 1500 

AAGGTGGACA AGAAAGTTGA GCCCAAATCT TGTGACAAAA CTCACACAGG 1550 

45 

GCCCTTCGTT TGTGAATATC AAGGCCAATC GTCTGACCTG CCTCAACCTC 1600 
CTGTCAATGC TGGCGGCGGC TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG 1650 

50 

GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG GGTGGCGGCT CTGAGGGAGG 1700 
55 CGGTTCCGGT GGTGGCTCTG GTTCCGGTGA TTTTGATTAT GAAAAGATGG 1750 

CAAACGCTAA TAAGGGGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA 1800 

60 

CAGTCTGACG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC 1850 
TGCTATCGAT GGTTTCATTG GTGACGTTTC CGGCCTTGCT AATGGTAATG 1900 

65 

GTGCTACTGG TGATTTTGCT GGCTCTAATT CCCAAATGGC TCAAGTCGGT 1950 
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GACGGTGATA ATTCACCTTT AAT6AATAAT TTCCGTCAAT ATTTACCTTC 2000 
CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTAGC GCTGGTAAAC 2050 
CATATGAATT TTCTATTGAT TGTGACAAAA TAAACTTATT CCGTGGTGTC 2100 
TTTGCGTTTC TTTTATATGT TGCCACCTTT ATGTATGTAT TTTCTACGTT 2150 
TGCTAACATA CTGCGTAATA AGGAGTCT 2178 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(&) LENGTH: 237 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Set Met Fhe Vai Phe 
1 5 10 15 

Ser He Ala Thr Asn Ala Tyr Ala Asp He Gin Met Thr Gin Ser 
20 25 30 

Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 
35 40 45 

Cys Arg Ala Ser Gin Asp Val Asn Thr Ala Val Ala Trp Tyr Gin 
50 55 60 

Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Ser Ala Ser 
1 65 70 75 

Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Arg Ser 



x>h« n a Thr Tvr Tvr Cvs Gin Gin His Tvr Thr Thr Pro Pro Thr 
J 110 " 115 120 

Phe Gly Gin Gly Thr Lys Val Glu He Lys Arg Thr Val Ala Ala 
125 130 135 

Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser 
140 145 150 

Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg 
155 160 165 

Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly 
170 175 180 

Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 
185 190 195 

Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 

200 205 210 

Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser 
215 220 225 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Lvs Lvs Asn He Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser He Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 
20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 45 

Ala Ala Ser Gly Phe Asn He Lys Asp Thr Tyr He His Trp Val 
50 55 60 

Arq Gin Ala Pro Glv Lvs Glv Leu Glu Tro Val Ala Ara He Tvr 
65 70 75 

Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp Ser Val Lys Gly Arg 
80 85 90 

Phe Thr He Ser Ala Asp Thr Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ser 
110 115 120 



Gly Thr Leu Val Thr Val ser Ser Ala Ser Thr Lys Gly Pro Ser 
140 145 150 

Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr 
155 160 165 

Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 

170 175 180 

Thr Val Ser Trp Asn Ser Glv Ala Leu Thr Ser Glv Val His Thr 
185 190 195 

Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser 
200 205 210 

Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He 
215 220 225 



Val Glu Pro Lys Ser Cys Asp Lvs Thr His Thr Glv Pro Phe Val 
245 250 255 
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Glv Gly Gly ser Giu Giy Giy Gly Ser Glu Gly Gly Gly Ser Glu 
290 295 300 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Ser Gly Asp Phe Asp Tyr 
305 310 315 

Glu Lye Met Ala Asn Ala Asn Lys Gly Ala Met Thr Glu Asn Ala 
320 325 330 

Asp Glu Asn Ala Leu Gin Ser Asp Ala Lys Gly Lys Leu Asp Ser 
335 340 345 

Val Ala Thr Asp Tyr Gly Ala Ala lie Asp Gly Phe He Gly Asp 
350 355 360 

Val Ser Gly Leu Ala Asn Gly Asn Gly Ala Thr Gly Asp Phe Ala 
365 370 375 

Gly Ser Asn Ser Gin Met Ala Gin Val Gly Asp Gly Abp Asn Ser 
380 385 390 

Pro Leu Met Asn Asn Phe Arg Gin Tyr Leu Pro Ser Leu Pro Gin 
395 400 405 

ser val Glu Cys Arg Pro Phe Val Phe Ser Ala Gly Lys Pro Tyr 
410 415 420 

Glu Phe Ser lie Asp Cys Asp Lys He Asn Leu Phe Arg Gly Val 
425 430 435 

Phe Ala Phe Leu Leu Tyr Val Ala Thr Phe Met Tyr Val Phe Ser 
440 445 4s0 

Thr Phe Ala Asn lie Leu Arg Asn Lys Glu Ser 
455 460 461 
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What is claimed is: 

1. A method for selecting novei binding polypeptides comprising: 

5 (a) constructing a replicable expression vector comprising 

a transcription regulatory element operabiy iinked to 
a gene fusion encoding a fusion protein wherein the gene fusion comprises 
a first gene encoding a polypeptide, and 
a second gene encoding at least a portion of a 
1 o phage coat protein; 

( b ) mutating the vector at one or more selected positions within the first gene thereby forming a 

family of related plasmids; 

(c ) transforming suitable host cells with the plasmids; 

15 

( ri ) infecting the transformed host cells with a helper phage 
having a gene encoding the phage coat protein; 

(e) culturing the transformed infected host cells under conditions 
20 suiiabie for forming recombinant phagernid particles containing at least a portion of 

the plasmid and capable of transforming the host, the conditions adjusted so that no 
more than a minor amount of phagernid particles display more than one copy of the 
fusion protein on the surface of the particle; 

25 (?) contacting the phagernid particles with a target molecule so that at least a portion of the 

phagernid partides bind io the target molecule; and 

( g ) separating the phagernid particles that bind from those that do not. 

30 2. The method of claim 1 further comprising infecting a suitable host cells with the phagernid particles that 
bind and repeating steps (d) through (g). 

3. The method of claim 2 wherein the steps are repeated one or more times. 

35 4 . The method of claim 1 wherein the expression vector further comprises a secretory signal sequence. 



5. The method of claim 1 wherein the transcription regulatory element is a promoter system selected 
from the group; /acZ, pbo A, tryptophan, tac, Xpl, bacteriophage T7, and combinations 
thereof. 
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6. The method of claim 1 wherein the first gene encodes a mammalian protein. 

7. The method of ciaim 6 wherein the protein is selected from the group; 

5 growth hormone, human growth hormone(hGH), des-N-methionyi human growth hormone, 

bovine growth hormone, parathyroid hormone, thyroxine, insulin A-chain, insulin B-chain, 
proinsuiin, reiaxin A-chain, reiaxin B-chain, prcrelaxin, follicle stimulating hormone(FSH), 
thyroid stimulating hormone(TSH), leutinizing hormone(LH), glycoprotein hormone receptors, 
calcitonin, glucagon, factor VIII, an antibody, lung surfactant, urokinase, streptokinase, human 

1 o tissue-type plasminogen activator (t-PA). bombesin, factor IX, thrombin, hemopoietic growth 

factor, tumor necrosis factor-alpha and -beta, enkephalinase, human serum albumin, muiierian- 
inhibiting substance, mouse gonadotropin-associated peptide, 3-kctamase, tissue factor 
protein, inhibin, acfivin, vascular endothelial growth factor, integrin receptors, mrombopoietJn, 
protein A or D. rheumatoid factors. NGF-p, platelet-growth factor, transforming growth 

1 5 factor . jQF-aipha and TGF-beta, insulin-like growth factor-l and -II, insulin-like growth 

factor binding proteins , CD4, DNase, latency associated peptide, erythropoietin, HER2 
ligands, osteoinductive factors, interferon-aipha. -beta, and -gamma, colony stimuiaiing 
factors (CSFs), M-C-SF, GM-GSF, and G-CSF, interleukins (ILs), IL-1, IL-2, IL-3, IL-4, 
superoxide dismutase; decay accelerating factor, viral antigen, HIV envelope proteins GP12Q 

20 and GP140, atrial natriuretic peptides A, B, or C, or immuno globulins, and fragments of the 

above-listed proteins. 

8. The method of claim 7 wherein the protein is a human protein. 

25 9. The method of claim 8 wherein the protein comprises more than about 100 amino acid residues. 

10. The method of claim 1 wherein the protein comprises a plurality of rigid secondary structures displaying 

amino acids capable of interacting wiih the target, and ihe mutations are primarily produced 
at positions corresponding to codons encoding the amino acids. 

30 

11. The method of claim 10 wherein the rigid secondary structures comprise structures selected from the 

group: a-{3.6i3)helix, 3io helix, jr-{4.4i6)helix, parallel and anti-parallel p-pleated sheets, 
reverse turns, and non-ordered structures. 

35 19. The metncd of daim 10 wherein tf^^ 

13. The method of ciaim 12 wherein the mutations are produced on more than one rigid secondary structure. 
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14. The method of claim 1 wherein the helper phage is selected from the group M13K07, M13R408, M13- 

VCS,andPhiX174. 

15. The method of claim 14 wherein the helper phage is M13K07 and the coat protein is the M13 phage gene 

Hi coat protein. 

5 

16. The method of claim 15 wherein the host is Eooff. 

17. The method of claim 16 wherein the piasmid is under tight control of the transcription regulatory 



i he method of ciaim 17 wherein the amount is iess ihan about i%. 

The method of claim 18 wherein the amount is less than 20% the amount of phagemid particles displaying 
a single copy of the fusion protein. 

The method of daim 1 9 wherein the amount is less than 1 0%. 

The method of claim 1 further comprising in step (a), inserting a DNA triplet, encoding an mRNA 
suppressive terminator codon between said first gens encoding a polypeptide, and said 
second gene encoding at least a portion of a phage coat protein. 

The method of claim 21 wherein said mRNA suppressive terminator codon is selected from the 
following: UAG (amber), UAA tocher) and UGA (opel). 

The method of claim 22 wherein said suppressive mutation results in the detectable production of a 
fusion polypeptide containing sadi polypeptide and said coat protein when said expression 
vector is grown in a suppressor host cell; and, when grown in a non-suppressor host cell said 
polypeptide is synthesized substantially without fusion to said phage coat protein. 

A human growth hormone variant wherein hGH amino acids 1 72, 1 74, 1 76 and 1 78 respectively are as a 
group sequentially selected from one of the following: (1)R I S,F,R; (2)R,A,Y,R; (3)K,T,Y,K; 
(4)R,S,Y,R; (5)K,A,Y,R; (6)R,F,F,R; (7)K,Q,Y,R; (8) R.T.Y.H; (9)Q,R,Y,R; (10)K,K,Y,K; 
(IIIRAKS: and (12)K,S,N,R. 

A phagemid comprising a replicable expression vector comprising a transcription regulatory element 
operaWy linked to a gene fusion encoding a fusion protein wherein the gene fusion comprises a 
first gene encoding a polypeptide, and a second gene encoding at ieast a portion of a phage 
coat protein, wherein a DNA triplet codon encoding an mRNA suppressive terminator codon 
selected from UAG, UAA and UGA is inserted between the fused ends of the first and second 
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genes, or is substituted for an amino add encoding triplet codon adjacent to the gene fusion 
junction. 

26. The phagemid of a'aim 25 wherein said first gene encodes a mammalian protein. 

5 

27. The phagemid of dam 26 wherein the protein is selects from the group: growth hormone, human 

growth hormone (hGH), des-N-meihionyi human growth hormone, bovine growth hormone, 
parathyroid hormone, thyroxine, insulin A-chain, insulin B-chain, proinsulin, relaxin A-chain, 
rslaxin B-chain, proreSaxin, follicle simulating hormone (FSH). thyroid stimulating hormone 

1 o (TSH), leutinizing hormone (LH), glycoprotein hormone receptors, calcitonin, glucagon, factor 

VIII, an antibody, lung surfactant, urokinase, streptokinase, human tissue-type plasminogen 
activator (t-PA), bombesin, factor IX, thrombin, hemopoietic growth factor, tumor necrosis 
factor-alpha and-beia, enkephaiinase, human serum albumin, mulierian-inribiting substance, 
mouse gonadotropin-assodated peptide, p-lactamase, tissue factor protein, inhibin, activin, 

i s vascular endothelial growth factor, integrin receptors, thrombopoietin. protein A or D, 

rheumatoid factors, NGF-p, platelet-growth factor, transforming growth factor; TGF-alpha 
and TGF-beta, insulin-like growth-l and -II, insulin-like growth factor binding proteins, CD-4, 
DNase, latency associated peptide, erythropoietin, osteoinductive factors, interferon-alpha, - 
beta, and -gamma, coiony siimuiaiing factors (CSFs), M-CSF, GM-CSF, and G-CSF, 

20 interieukins (ILs), IL-1 , IL-2, IL-3, IL-4, superoxide dismutase; decay accelerating factor, viral 

antigen, HIV envelope proteins GP12D and GP14Q, atrial natriuretic peptides A, B or C 
immuno globulins, and fragments of the above-iisted proteins. 

28. The phagemid of claim 27 wherein said protein is a human protein, 

25 

29. The phagemid of claim 28 wherein the protein comprises more than about 100 amino acid residues. 

30. The phagemid of daim 25 wherein said protein comprises a plurality of rigid secondary structures 

displaying amino adds capable of interacting with the target 

30 

31. The phagemid of daim 30 wherein said rigid secondary structures comprises structures selected from 

the group; a-(3.6i3)helix, 3in helix, jr-(4.4i 6 )helix, parallel and anti-parallel p-pteated 
sheets, reverse turns, and non-ordered structures. 

35 32. The phagemid of daim 25 wherein the helper phage is selected from the group M13K07, M13R408, 
M13-VCS,andPhiX174. 



The phagemid of claim 32 wherein the helper phage is M1 3K07 and the coat protein is the M1 3 phage 
gene Hi coat protein. 
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The phagemid of ciaim 33 wherein {he host is the £ coB wild type or suppressor type. 

The phagemid of claim 34 wherein the plasmid is under tight control of the transcription regulatory 

element. 

The phagemid of claim 35 wherein the number of phagemid particles displaying more than one copy of 
the fusion proiein on the surface of the particles is less than 1%. 

The phagemid of claim 36 wherein said number of phagemid particles is less than about 1 0%, 

The phagemid of claim 37 wherein the number of phagemid particles is less than about 20%. 

A human growth variant wherein hGH amino acids 10, 14,18, and 21 respectively are as a group 
sequentially selected from one of the following: 
{1)H,G,N,N; (2)A,W,D,N; (3)F,S,F,L; (4>Y,T,V,N and {5)!,N,!,N. 

A human growth variant wherein hGH amino acids 174 is serine and 176 is tyrosine and hGH amino acids 
167, 171, 175 and 179 respectively are as a group sequentially selected from one of the 
following: 

(1)N,S,T,T; (2)E,S,T,I; (3)K,S.T,L; (4)N,N,T,T; (5) R,D,I,I; and (6)N,S,T,Q. 

A method for selecting novei binding polypeptides comprising 

( a ) constructing a replicable expression vector comprising a 

transcription regulatory element operably linked to DNA encoding a protein of 
interest containing one or more subunits, wherein the DNA encoding at least one of 
the subunits is fused to the DNA encoding at least a portion of a phage coat protein; 

( b ) mutating the DNA encoding the protein of interest at one or more selected positions thereby 

forming a family of related vectors; 

(c ) transforming suitable host cells with the vectors; 

( d ) infecting the transformed host ceils with a helper phage having a gene encoding the phage 



cuituring the transformed infected host ceiis under conditions suitable for forming 

recombinant phagemid particles containing at least a portion of the plasmid and 
capable of transforming the host, the conditions adjusted so that no more than a 
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minor amount of phagemid parades display more than one copy of the fusion protein 
on the surface of the particle; 

( f ) contacting the phagemid particles with a target molecule so that at least a portion of the 

phagemid particles bind to the target molecule; and 

(g ) separating the phagemid parades that bind from those thai do not 

The method of claim 41 wherein the expression vector further comprises a secretory signal sequence 
operably linked to the DNA encoding each subunit of the protein of interest. 



43 . The method of claim 42 wherein, the protein of interest is a mammalian protein. 

44 . The method of daim 43 wherein the protein of interest is selected from the group; 

1 5 insulin, relaxin, fcllide stimulating hormone (FSH), thyroid stimulating hormone (TSH), 

leutinizing hormone (tH), glycoprotein hormone receptors, monodonai and polyclonal 
antibodies, lung surfactant, integrin receptors, insulin-like growth factor-l and -II, and 
fragments of the above-listed proteins. 

20 45. The method of daim 44 wherein the protein of interest is a humanized antibody. 

46 . The method of claim 45 wherein the protein of interest is a humanized Fab fragment capable of binding 
to the HER-2 receptor (human epidermal growth fador receptor-2). 

25 47 A human growth hormone (hGH) variant wherein hGH amino acid glutamatei74 is replaced by serinei74 
and phenylalaninei76 is replaced by tyroslnei76 and one or more of the eight naturally 
occurring hGH amino adds FtO, M14, H18, H21 , R167, D171, T175 and 1179 are replaced by 
another natural amino add. 

30 48 The hGH variant of daim 47 wherein the eight naturally occurring hGH amino adds F10, M14, Hi 8, H21 , 
R167, D171 , T175 and 1179 respectively are as a group replaced with a corresponding amino 
add sequentially selected from one of the following groups: 



(I) H r G, N, N, N, S f T, T; 
(3) H, G r N, N, N, N, T, T; 
(5) A, W, D, N, E, S, T r I; 
(7) F, S, F, L, N, S, T, T; 
(9) F f S, F r - L, N, N, T, T. 

(II) A, N, D, A r N, N, T, N; 
(13) H, Q, T, S, A, D, K, S; 
(15) F, S, F, L r S, D, T, T; 
(17) Q r Y, N r N r H r S, T, T; 
(19) F f L, S, S, K, N, T r V; 



(2) H r G, N, N, E r S, T f I; 

(4) A r W r D r N, N, S f T r T; 

(5) A r W, D r N, N, N, T, T; 
(8) F r S, F r - L, E, S,- T, I; 
(10) H, G, N, N, N, S, T r N; 
(12) F, S, F r G, H, S, T r T; 
(14) H, G, N r N, N, A, T, T; 
(16) A, S, T r N, R, D, T, I; 
(18) K, G, S, S, R, D, T r I; 
(2Q) W f vs r N f S, H,- S, T, T; 
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(21) A, N, A, S, N, S, T, T; 

(23) H, G, N, N, N, N, T, S; 

(25) M, T, S, N, Q, S, T, T; 

(27) A, W, D, N, R# D, T, I; 

(29) M, Q, M, N, N, S, T, T; 

(31) L, N, S, H, R, D, T, I; 

(33) A, W, D, N, N, A, T, x; 

(35) A, W, D, N, R, D, T, I; 

(37) F f S, L,- A. N, S, T, V; 

(39) M, A, D, N, N, S, T, T; 

(41) H, Q, Y, S, R, D, T, I. 



79 

(22) P, S, D, N, R, D, T, I; 

(24) F, S, X, G, R, D, T, I; 

(26) F, S, F, L, T, S, T, S; 

(28) A, W, D, N, H, S, T, N; 

(30) H, Y, D, H, R, D, T, T; 

(32) L f N, S, H, T, S, T, T; 

(34) F, S, T, G, R, D, T, I; 

{35} I, Q, E, H, N, S, T, T; 

(38) F, S, F. L. K. D, T, T; 

(40) A, W, D, N, S, S, V, T; 



43. The method of claim 48 wherein said human growth hormone variant {1 1} further contains leucinei 5 replaced 
by argininei 5 and lysinei 68 replaced by argininei68- 



1. The method of claim 48 wherein said human growth hormone variant (40) further contains phenylalanine 1 76 



51 . A method for selecting novei binding polypeptides comprising: 



20 (a) constructing a repiicable expression vector comprising 

a transcription regulatory eiement operabiy iinked to 

a gene fusion encoding a fusion protein wherein the gene fusion comprises 

a firs! gene encoding a polypeptide operable connedted to a linking amino 

acid sequence, and 

25 a second gene encoding at least a portion of a 

phage coat protein; 



( b ) mutating the vector at one or more selected positions within the amino acid linking sequence of 

the first gene thereby forming 3 family of related plasmids; 

30 

( c ) transforming suitable host cells with the plasmids; 



( d ) infecting the transformed host cells with a helper phage 

having a gene encoding the phage coat protein; 

(e) culturing the transformed infected host cells under conditions 

suitable for forming recombinant phagemid particles containing at least a portion of 
the piasmid and capable of transforming the host, the conditions adjusted so that no 
more than a minor amount of phagemid particles display more than one copy of the 
fusion protein on the surface of Ihe particle; 



(f) 



contacting the phagemid particles with a target molecule so that at least a portion of the 
phagemid particles bind to the target molecule; and 



WO 92/09690 PCI7US91/09I33 



(g ) contacting the bound phagemid particies with a protease capable of hydroiysing the linking a 
aminio acid sequence of at least a portion, of the bound phagmid particies, and 

5 (h) isolating the hydrolyzed phagmid particles. 

52. The method of claim 51 further comprising infecting suitable host ceiis with the hydrolyzed phagemid 
particles and repeating steps (d) through (h). 
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